Make curve fitting faster

Question

Marek Dostal on 24 Nov 2018

1
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/431697-make-curve-fitting-faster

Edited: Bruno Luong on 10 Nov 2022

Accepted Answer: Matt J

Hi,

I would like to make my curve fitting faster by using GPU. Is it possible? If yes how?

Thanks

1 Comment
Show -1 older commentsHide -1 older comments

Matt J on 25 Mar 2019

Marek's problem description copied here for reference:

I have MRI data of brain (each subject has approx. 0,5 mil voxels) with 10 different parameters and I want to in each voxel fit this 10 points by specific biexponencial curve (so I receive parametric maps of brain). I tried to optimize the script before I used parallel computing (1 brain ~4 hours) but I used only parfor (~1,25 hours) and because I am still beginner with Matlab I was curious if other users have experience with GPU and can help me make this faster than parfor.

Thanks

Marek

Sign in to comment.

Sign in to answer this question.

Answer 1

Matt J on 26 Nov 2018

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/431697-make-curve-fitting-faster#answer_348931

Edited: Matt J on 21 Jun 2019

I want to in each voxel fit this 10 points by specific biexponencial curve (so I receive parametric maps of brain)

@Marek,

Do not fit one voxel at a time in a loop. Instead, write the model as an N-dimensional curve where N is the number of voxels and apply lsqcurvefit to this model. Do not let lsqcurvefit use its default finite difference derivative calculations. Instead, supply your own Jacobian*x operator using the JacobianMultiplyFcn option (EDIT: or provide a sparse block diagonal Jacobian matrix computation).

Your problem is a good example of what I was discussing with Joss: both the model function and the JacobianMultiplyFcn could be GPU-optimized if lsqcurvefit could work with gpuArray variables without constantly doing GPU/CPU transfers. However, I expect the fit will still be reasonably fast if you use appropriate Matlab vectorization techniques.

16 Comments
Show 14 older commentsHide 14 older comments

sarah goldberg on 25 Mar 2019

Open in MATLAB Online

@Marek, @Matt,

I am trying to implement Matt J's suggestion regarding vectorizing for N fits using lscurvefit (see code below for loop over 1D fits vs. vectorized fit - the vectorized version withJacobian multiplication function is not working yet).

My main problems:

1) I do not understand what the dimensions of the different products returned by the Jacobian multiplication function should be. How can both C*q and C'*q be valid if C is not quare?

2) I do not understand how to pass the Jacobian multiplication function (first attempt below).

Does anyone have a working example that also compares the runtime to the looped 1D case?

Thanks!

Sarah

clear all; close all; fclose all;
% model is a*exp(-0.5*((x-m)/s)^2), with different a, m, s for each row
num_g=10; % number of gaussians (rows)
a0=4; 
a_std=1;
mu0=40;
mu_std=15;
sigma0=3;
sigma_std=1;
xx=[1:1:100]; % these are the pixel values at which we will have data
% generate data
mus=mu0 + mu_std*randn(1,num_g);
sigmas=sigma0 + sigma_std*randn(1,num_g);
as=a0 + a_std*randn(1,num_g);
sigmas(sigmas==0)=sigma0; % to avoid division by 0
% 1D data - each column is a different object
mu_mat=ones(size(xx,2),1)*mus;
sigma_mat=ones(size(xx,2),1)*sigmas;
a_mat=ones(size(xx,2),1)*as;
xx_mat=xx'*ones(1,size(mus,2));
yy_mat=a_mat.*exp( - (((xx_mat-mu_mat)./sigma_mat).^2)/2 );
%% 1D fit, looped over rows (for comparison)
fun_1D=@(x,xdata)x(3)*exp( - (((xdata-x(1))./x(2)).^2)/2 ); % gaussian model
opts1=optimset('display','off');
p_guess=nan*ones(num_g,3); % guess
p_fit=nan*ones(num_g,3+2); % fit results
tic
for ii=1:num_g
    [maxval,maxpos]=max(yy_mat(:,ii));
    [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
    p0=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval];
    p_guess(ii,:)=p0;
    try
        [x,resnorm,residual,exitflag,output]=lsqcurvefit(fun_1D,p0,xx_mat(:,ii)',yy_mat(:,ii)',[],[],opts1);
        p_fit(ii,:)=[x,resnorm,exitflag];
    catch
        continue
    end
end
fprintf(1,['\n1D, n_g=' num2str(num_g) '\n']);
toc
% plot first 10 to check
figure('name','1D');clf;hold on;
xx_dense=[min(xx_mat):0.1:max(xx_mat)];
legs={};
for ii=1:min(num_g,10)
    ph=plot(xx_mat(:,ii),yy_mat(:,ii),'.'); % data
    plot(xx_dense,fun_1D(p_guess(ii,1:3),xx_dense),':','color',get(ph,'color')); % guess
    plot(xx_dense,fun_1D(p_fit(ii,1:3),xx_dense),'-','color',get(ph,'color')); % fit
end
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND without Jacobian multiplication function
fun_ND=@(x,xdata)x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
opts1=optimset('display','off');
tic
p0_ND=nan*ones(num_g,3); % guess
p_fit=nan*ones(num_g,3+2); % fit results
% build guess matrix
for ii=1:num_g
    [maxval,maxpos]=max(yy_mat(:,ii));
    [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
    p0_ND(ii,:)=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval]; % each row is 1 object
end
[pfit_ND,resnorm,residual,exitflag,output]=lsqcurvefit(fun_ND,p0_ND,xx_mat',yy_mat',[],[],opts1); % each row is 1 object
fprintf(1,['\nND, no J supplied, n_g=' num2str(num_g) '\n']);
toc
% plot first 10 to check
figure('name','ND');clf;hold on;
%figure(fh);hold on;
maxind=min(10,num_g);
yy_fit=fun_ND(pfit_ND,xx_mat'); yy_fit=yy_fit';
yy_guess=fun_ND(p0_ND,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'.-');
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND fit with Jacobian multiplication function
if(0)
    fun_ND=@(x,xdata)x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
    Jinfo = [speye(n);speye(n)]; % sparse matrix for preconditioning - required input, not used
    opts2 = optimoptions('lsqcurvefit','JacobianMultiplyFcn',@(Jinfo,Y,flag)myJ(Jinfo,Y,flag,p));
    
    p0_ND=nan*ones(num_g,3); % guess
    p_fit=nan*ones(num_g,3+2); % fit results
    
    tic
    % build guess matrix
    for ii=1:num_g
        [maxval,maxpos]=max(yy_mat(:,ii));
        [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
        p0_ND(ii,:)=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval]; % each row is 1 object
    end
    [pfit_ND,resnorm,residual,exitflag,output]=lsqcurvefit(fun_ND,p0_ND,xx_mat',yy_mat',[],[],opts2); % each row is 1 object
    fprintf(1,['\nND, J multiplication supplied, n_g=' num2str(num_g) '\n']);
    toc
    % print to check
    figure(3);clf;hold on;
    maxind=min(10,num_g);
    yy_fit=fun_ND(pfit_ND,xx_mat'); yy_fit=yy_fit';
    yy_guess=fun_ND(p0_ND,xx_mat');yy_guess=yy_guess';
    plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),'.-');
    xlabel('xx [pixel]');
    ylabel('yy [intensity]');
end
% compute Jacobian multiply functions
function w = myJ(Jinfo,Y,flag,p)
    if flag > 0
        w = Jpositive(Y);
    elseif flag < 0
        w = Jnegative(Y);
    else
        w = Jnegative(Jpositive(Y));
    end
    
    function a = Jpositive(q)
        % Calculate C*q
        a = zeros(len,size(q,2)); % not clear what size a should be
        % something here
    end
    function a = Jnegative(q)
        % Calculate C'*q
        a = zeros(len,size(q,2)); % not clear what size a should be
        % something here
    end
end

sarah goldberg on 27 Mar 2019

Edited: sarah goldberg on 27 Mar 2019

Open in MATLAB Online

Thanks Matt for the quick response.

I filled in the missing parts of the Jacobian multiply myJ, assuming a certain ordering of the parameters. My questions:

1) What is the ordering of the fit parameters for the Jacobian index j? I assumed that for num_g fits, 3 parameters [mu, sigma, a] per fit, the order is:

[mu1,mu2,...,mu_num_g, sigma1,sigma2,...,sigma_num_g, a1,a2,...,a_num_g] .

Is this correct?

2) I do not know how to pass additional arguments to the Jacobian multiplication function myJ. I'm attaching the current code, which does NOT compile.

Thanks for your help.

Sarah

%fun_ND=@(x,xdata)x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
    opts2 = optimoptions('lsqcurvefit','JacobianMultiplyFcn',@(Jinfo,Y,flag,f,P,n,m,k)myJ(Jinfo,Y,flag,f,P,n,m,k),'SpecifyObjectiveGradient',true);
    
    p0_ND=nan*ones(num_g,3); % guess
    p_fit=nan*ones(num_g,3+2); % fit results
    
    tic
    % build guess matrix
    for ii=1:num_g
        [maxval,maxpos]=max(yy_mat(:,ii));
        [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
        p0_ND(ii,:)=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval]; % each row is 1 object
    end
    [pfit_ND,resnorm,residual,exitflag,output]=lsqcurvefit(@(x,xdata)fun_ND_withJ(p0_ND,xx_mat'), p0_ND, xx_mat', yy_mat',[],[],opts2); % each row is 1 object
    fprintf(1,['\nND, J multiplication supplied, n_g=' num2str(num_g) '\n']);
    toc
    % plot to check
    figure('name','ND with Jacobian');clf;hold on;
    maxind=min(10,num_g);
    yy_fit=fun_ND(pfit_ND,xx_mat'); yy_fit=yy_fit';
    yy_guess=fun_ND(p0_ND,xx_mat');yy_guess=yy_guess';
    plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),'.-');
    xlabel('xx [pixel]');
    ylabel('yy [intensity]');
    
    function [f,Jinfo,fcopy,x,n,m,k] = fun_ND_withJ(x,xdata)
    % x is             num_g x num_params_per_fit (10x3)
    % xdata is         num_g x num_points_per_fit (10x100)
    % f is             num_g x num_points_per_fit (10x100) 
	f=x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
    fcopy=f; % cannot have 2 outputs with the same name, do not know how to pass f to myJ
    n=size(xdata,2);
    m=size(x,2);
    k=size(x,1);
    
    % Jinfo sparse matrix for preconditioning - required input.
    % size should equal the size of the Jacobian C.
    Jinfo = speye( prod(size(xdata)), prod(size(x))); 
end
% compute Jacobian multiply functions. 
% note: Jacobian C is sparse. the only nonzero elements are 
% derivatives of values fi belonging to fit j 
% with respect to their own fitting parameters [mj,sj,aj].
% for xi "connected" to params Pj=[mj,sj,aj], we have:
% for mu==mj:        dfi/dmj = -fi*(xi-mj)/sj^2
% for sigma==sj:     dfi/dsj =  fi*(xi-mj)^2/sj^3
% for a==aj:         dfi/daj =  fi/aj    
% we need f=[xdata(1);xdata(2);...;xdata(num_g)] and parameters Pj for all j
function w = myJ(Jinfo,Y,flag,f,P,n,m,k)
    if flag > 0
        w = Jpositive(Y);
    elseif flag < 0
        w = Jnegative(Y);
    else
        w = Jnegative(Jpositive(Y));
    end
    
    function a = Jpositive(q,f,P,n,m,k)
        % Calculate C*q (dimensions NxP, Px1 => Nx1) 
        a = zeros(size(f,1));
        for ff=1:k % loop over fits
            p=P(m*(ff-1)+1:m*ff);  % params for this fit (m)
            fp=f(n*(ff-1)+1:n*ff); % points for this fit (n)
            qp=q(n*(ff-1)+1, n*(ff-1)+1+m , n*(ff-1)+1+2*m); % part of input vector that contributes (m)
            a(fp)=fp.*( -qp(1)*(fp-p(1))/p(2)^2 + qp(2)*(fp-p(1)).^2/p(2)^3+ qp(3)/p(3));
        end
    end
    function a = Jnegative(q,f,P,n,m,k)
        % Calculate C'*q (dimensions PxN, Nx1 => Px1) 
        a = zeros(size(P),1); 
        for ff=1:k % loop over fits
            p=P(m*(ff-1)+1:m*ff);  % params for this fit (m)
            fp=f(n*(ff-1)+1:n*ff); % points for this fit (n)
            qp=q(n*(ff-1)+1:n*ff); % part of input vector that contributes (n)
            a(ff)=    sum(-fp.*qp.*(fp-p(1)))   /p(2)^2;   % mus, at positions [1:k]
            a(ff+k)=  sum( fp.*qp.*(fp-p(1)).^2)/p(3)^3;   % sigmas, at [k+1:2*k]
            a(ff+2*k)=sum( fp.*qp)              /p(3);     % as, at [2*k+1:3*k]
        end
    end
end
    
    

sarah goldberg on 27 Mar 2019

Edited: sarah goldberg on 27 Mar 2019

Open in MATLAB Online

Hi Matt,

Still not working. I am getting an error

Error using NDfit_JacobianMult>@(Jinfo,Y,flag)myJ(Jinfo,Y,flag,f,P,n,m,k)
Too many input arguments.
Error in lsqcurvefit/jacobmult (line 277)
         W = feval(mtxmpy_xdata,Jinfo,Y,flag,XDATA,varargin{:});
Error in snls (line 204)
g = feval(mtxmpy,A,fvec(:,1),-1,varargin{:});
Error in lsqncommon (line 166)
            snls(funfcn,xC,lb,ub,flags.verbosity,options,defaultopt,initVals.F,initVals.J,caller, ...
Error in lsqcurvefit (line 257)
    lsqncommon(funfcn,xCurrent,lb,ub,options,defaultopt,caller,...
Error in NDfit_JacobianMult (line 148)
[pfit_ND_J,resnorm,residual,exitflag,output]=lsqcurvefit(@fun_ND_withJ, p0_ND_J, xx_mat', yy_mat',[],[],opts2); % each row is 1 object 

from the attached code (below).

Replacing opts2 with opts1 works, so the problem is definitely with the definition of the Jacobian Multiply function.

I tried using global parameters instead of passing them, did not work.

I tried having the fit function return the parameters, did not work.

Apparently myJ is never entered.

How do the parameters get passed to the myJ lsqcurvefit? This is where the problem seems to be.

Thanks,

Sarah

clear all; close all; fclose all;
% model is a*exp(-0.5*((x-m)/s)^2), with different a, m, s for each row
num_g=20; % number of gaussians (rows)
a0=4; 
a_std=1;
mu0=40;
mu_std=15;
sigma0=3;
sigma_std=1;
xx=[1:1:100]; % these are the pixel values at which we will have data
% generate data
mus=mu0 + mu_std*randn(1,num_g);
sigmas=sigma0 + sigma_std*randn(1,num_g);
as=a0 + a_std*randn(1,num_g);
sigmas(sigmas==0)=sigma0; % to avoid division by 0
% 1D data - each column is a different object
mu_mat=ones(size(xx,2),1)*mus;
sigma_mat=ones(size(xx,2),1)*sigmas;
a_mat=ones(size(xx,2),1)*as;
xx_mat=xx'*ones(1,size(mus,2));
yy_mat=a_mat.*exp( - (((xx_mat-mu_mat)./sigma_mat).^2)/2 );
%% ND fit with Jacobian multiplication function
%global G_f G_P G_m G_n G_k;
opts1=optimset('display','off');
opts2 = optimoptions('lsqcurvefit','SpecifyObjectiveGradient',true,'JacobianMultiplyFcn',@(Jinfo,Y,flag)myJ(Jinfo,Y,flag,f,P,n,m,k));
p0_ND_J=nan*ones(num_g,3); % guess
p_fit_J=nan*ones(num_g,3+2); % fit results
tic
% build guess matrix
for ii=1:num_g
    [maxval,maxpos]=max(yy_mat(:,ii));
    [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
    p0_ND_J(ii,:)=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval]; % each row is 1 object
end
[pfit_ND_J,resnorm,residual,exitflag,output]=lsqcurvefit(@fun_ND_withJ, p0_ND_J, xx_mat', yy_mat',[],[],opts2); % each row is 1 object
fprintf(1,['\nND, J multiplication supplied, n_g=' num2str(num_g) '\n']);
toc
% plot to check
figure('name','ND with Jacobian');clf;hold on;
maxind=min(10,num_g);
yy_fit=fun_ND_withJ(pfit_ND_J,xx_mat');yy_fit=yy_fit';
yy_guess=fun_ND_withJ(p0_ND_J,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);hold on;
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'.-');
xlabel('xx [pixel]');
ylabel('yy [intensity]');
% model for ND fit with Jacobian multiplication
function [f,Jinfo] = fun_ND_withJ(x,xdata)
    %global G_f G_P G_m G_n G_k;
    % x is             num_g x num_params_per_fit (10x3)
    % xdata is         num_g x num_points_per_fit (10x100)
    % f is             num_g x num_points_per_fit (10x100) 
	f=x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
    %fcopy=f; % cannot have 2 outputs with the same name, do not know how to pass f to myJ
%     G_f=f;
%     G_P=x;
%     G_n=size(xdata,2);
%     G_m=size(x,2);
%     G_k=size(x,1);
    
    % Jinfo sparse matrix for preconditioning - required input.
    % size should equal the size of the Jacobian C.
    Jinfo = speye( prod(size(xdata)), prod(size(x))); 
end
% compute Jacobian multiply functions. 
% note: Jacobian C is sparse. the only nonzero elements are 
% derivatives of points xi belonging to fit j 
% with respect to their own fitting parameters [mj,sj,aj].
% for xi "connected" to params Pj=[mj,sj,aj], we have:
% for mu==mj:        dxi/dmj = -fi*(xi-mj)/sj^2
% for sigma==sj:     dxi/dsj =  fi*(xi-mj)^2/sj^3
% for a==aj:         dxi/daj =  fi/aj    
% we need f=[xdata(1);xdata(2);...;xdata(num_g)] and parameters Pj for all j
function w = myJ(Jinfo,Y,flag,f,P,n,m,k)
    %global G_f G_P G_m G_n G_k;
    'here'
%     f=G_f; %Y;
%     P=G_P;
%     n=G_n;
%     m=G_m;
%     k=G_k;
    if flag > 0
        w = Jpositive(Y,f,P,n,m,k);
    elseif flag < 0
        w = Jnegative(Y,f,P,n,m,k);
    else
        w = Jnegative(Jpositive(Y,f,P,n,m,k));
    end
    
    function a = Jpositive(q,f,P,n,m,k)
        % Calculate C*q (dimensions NxP, Px1 => Nx1) 
        a = zeros(size(f,1));
        for ff=1:k % loop over fits
            p=P(m*(ff-1)+1:m*ff);  % params for this fit (m)
            fp=f(n*(ff-1)+1:n*ff); % points for this fit (n)
            qp=q(n*(ff-1)+1, n*(ff-1)+1+m , n*(ff-1)+1+2*m); % part of input vector that contributes (m)
            a(fp)=fp.*( -qp(1)*(fp-p(1))/p(2)^2 + qp(2)*(fp-p(1)).^2/p(2)^3+ qp(3)/p(3));
        end
    end
    function a = Jnegative(q,f,P,n,m,k)
        % Calculate C'*q (dimensions PxN, Nx1 => Px1) 
        a = zeros(size(P),1); 
        for ff=1:k % loop over fits
            p=P(m*(ff-1)+1:m*ff);  % params for this fit (m)
            fp=f(n*(ff-1)+1:n*ff); % points for this fit (n)
            qp=q(n*(ff-1)+1:n*ff); % part of input vector that contributes (n)
            a(ff)=    sum(-fp.*qp.*(fp-p(1)))   /p(2)^2;   % mus, at positions [1:k]
            a(ff+k)=  sum( fp.*qp.*(fp-p(1)).^2)/p(3)^3;   % sigmas, at [k+1:2*k]
            a(ff+2*k)=sum( fp.*qp)              /p(3);     % as, at [2*k+1:3*k]
        end
    end
end

sarah goldberg on 3 Apr 2019

Edited: sarah goldberg on 3 Apr 2019

Open in MATLAB Online

Matt J, Catalytic -

Thanks for the tips. The following code compiles, and the vectorized version is faster for large num_g, but the vectorized version does not fit as well as the non-vectorized (the fits are certainly not good enough). Any ideas why?

Sarah

(edited to correct Jacobian. surprisingly it had little effect on the fit.)

results:

generated by this code:

clear all; close all; fclose all;
% model is a*exp(-0.5*((x-m)/s)^2), with different a, m, s for each row
num_g=10; % number of gaussians (rows)
maxind=min(10,num_g); % number of gussians we will plot
a0=4; 
a_std=5;
mu0=40;
mu_std=30;
sigma0=3;
sigma_std=1;
xx=[1:1:100]; % these are the pixel values at which we will have data
% % generate random data
% mus=mu0 + mu_std*randn(1,num_g);
% sigmas=sigma0 + sigma_std*randn(1,num_g);
% as=a0 + a_std*randn(1,num_g);
% sigmas(sigmas==0)=sigma0; % to avoid division by 0
% generate ordered data (for debugging)
mus=mu0 + mu_std*[1:num_g]/num_g;
sigmas=sigma0 + sigma_std*[1:num_g]/num_g;
as=a0 + a_std*[1:num_g]/num_g;
% 1D data - each column is a different object
mu_mat=ones(size(xx,2),1)*mus;
sigma_mat=ones(size(xx,2),1)*sigmas;
a_mat=ones(size(xx,2),1)*as;
xx_mat=xx'*ones(1,size(mus,2));
yy_mat=a_mat.*exp( - (((xx_mat-mu_mat)./sigma_mat).^2)/2 );
%% 1D fit, looped over rows (for comparison)
fun_1D=@(x,xdata)x(3)*exp( - (((xdata-x(1))./x(2)).^2)/2 ); % gaussian model
opts1=optimset('display','off');
p_guess=nan*ones(num_g,3); % guess
pfit=nan*ones(num_g,3+2); % fit results
% find starting points==guesses
for ii=1:num_g
    [maxval,maxpos]=max(yy_mat(:,ii));
    [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
    p_guess(ii,:)=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval];
end
% fit
tic
for ii=1:num_g
    [x,resnorm,residual,exitflag,output]=lsqcurvefit(fun_1D,p_guess(ii,:),xx_mat(:,ii)',yy_mat(:,ii)',[],[],opts1);
    pfit(ii,:)=[x,resnorm,exitflag];
end
fprintf(1,['\n1D, n_g=' num2str(num_g) '\n']);
toc
% plot first 10 to check
figure(1);clf;hold on;set(gcf,'position',[  680   134   907   844]);
subplot(311);hold on;
title('1D');
xx_dense=[min(xx_mat):0.1:max(xx_mat)];
for ii=1:min(num_g,10)
    ph=plot(xx_mat(:,ii),yy_mat(:,ii),'.'); % data
    plot(xx_dense,fun_1D(p_guess(ii,1:3),xx_dense),':','color',get(ph,'color')); % guess
    plot(xx_dense,fun_1D(pfit(ii,1:3),xx_dense),'-','color',get(ph,'color')); % fit
end
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND without Jacobian multiplication function
fun_ND=@(x,xdata)x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
opts1=optimset('display','off');
p0_ND=nan*ones(num_g,3); % guess
p0_ND=p_guess; % same as 1D
pfit_ND=nan*ones(num_g,3+2); % fit results
tic
[pfit_ND,resnorm,residual,exitflag,output]=lsqcurvefit(fun_ND,p0_ND,xx_mat',yy_mat',[],[],opts1); % each row is 1 object
fprintf(1,['\nND, no J supplied, n_g=' num2str(num_g) '\n']);
toc
% plot first 10 to check
figure(1);
subplot(312);hold on;
title('ND');
yy_fit=fun_ND(pfit_ND,xx_mat'); yy_fit=yy_fit';
yy_guess=fun_ND(p0_ND,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'-');
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND fit with Jacobian multiplication function
global G_f G_P G_n G_k;
opts2 = optimoptions('lsqcurvefit','display','iter','SpecifyObjectiveGradient',true,'JacobianMultiplyFcn',@(Jinfo,Y,flag,xdata)myJ(Jinfo,Y,flag));
p0_ND_J=nan*ones(num_g,3); % guess
p0_ND_J=p_guess; % same as 1D
pfit_ND_J=nan*ones(num_g,3+2); % fit results
p0_ND_J_vec=[p0_ND_J(:,1) p0_ND_J(:,2) p0_ND_J(:,3)]; % vectorized to define order: all mu, then all sigma, then all a
tic
[pfit_ND_J,resnorm,residual,exitflag,output]=lsqcurvefit(@fun_ND_withJ, p0_ND_J_vec, xx_mat', yy_mat',[],[],opts2); % each row is 1 object
fprintf(1,['\nND, J multiplication supplied, n_g=' num2str(num_g) '\n']);
toc
% plot to check
figure(1);
subplot(313);hold on;
title('ND with Jacobian');
maxind=min(10,num_g);
yy_fit=fun_ND_withJ(pfit_ND_J,xx_mat');yy_fit=yy_fit';
yy_guess=fun_ND_withJ(p0_ND_J,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);hold on;
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'-');
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
% model for ND fit with Jacobian multiplication
% x is             num_g x num_params_per_fit (num_g x 3)
% xdata is         num_g x num_points_per_fit (num_g x 100)
% f is             num_g x num_points_per_fit (num_g x 100)
function [f,Jinfo] = fun_ND_withJ(x,xdata)
    global G_f G_P G_n G_k G_xd;
    % update globals
    G_P=x;
    G_xd=xdata;
    G_n=size(xdata,2); % number of data points
    G_k=size(xdata,1); % number of fits
    % Jinfo sparse matrix for preconditioning - required input.
    % size should equal the size of the Jacobian C.
    Jinfo = speye(G_n*G_k, size(x,1)*G_k); 
    f=x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
    G_f=f;
end
% compute Jacobian multiply functions. 
% note: Jacobian C is sparse. the only nonzero elements are 
% derivatives of points xi belonging to fit j 
% with respect to their own fitting parameters [mj,sj,aj].
% for xi "connected" to params Pj=[mj,sj,aj], we have:
% for mu==mj:        dxi/dmj = -fi*(xi-mj)/sj^2
% for sigma==sj:     dxi/dsj =  fi*(xi-mj)^2/sj^3
% for a==aj:         dxi/daj =  fi/aj    
% we need f=f(xdata(1);xdata(2);...;xdata(num_g);P) and parameters Pj for all j
function w = myJ(Jinfo,Y,flag)
    global G_f G_P G_n G_k G_xd;
    P=G_P;
    f=G_f;
    n=G_n;
    k=G_k;
    xd=G_xd;
    
    if flag > 0
    w = Jpositive(Y,f,P,n,k,xd);
    elseif flag < 0
        w = Jnegative(Y,f,P,n,k,xd);
    else
        w1= Jpositive(Y,f,P,n,k,xd);
        w = Jnegative(w1,f,P,n,k,xd);
    end
    
    function a = Jpositive(q,f,P,n,k,xd)
        % Calculate C*q (dimensions NxP, Px1 => Nx1) 
        a = zeros(prod(size(f)),1);
        for ff=1:k % loop over fits, calculate n sums for each fit
            fp=f(ff,:);  % f of points for this fit (n)
            xp=xd(ff,:); % points for this fit (n)
            p=P(ff,:);   % params for this fit (m)
            qp=q([ff + (size(P,1))*[0, 1, 2]] ); % part of input vector q that contributes (m)
            a([(n*(ff-1)+1):n*ff])=fp.*( -qp(1)*(xp-p(1))/p(2)^2 + qp(2)*(xp-p(1)).^2/p(2)^3+ qp(3)/p(3)); % enter n derivatives 
        end
    end
    function a = Jnegative(q,f,P,n,k,xd)
        % Calculate C'*q (dimensions PxN, Nx1 => Px1) 
        a = zeros(prod(size(P)),1); % Px1
        for ff=1:k % loop over fits, calculate 3 sums for 3 fit parameters
            p=P(ff,:);             % params for this fit (m)
            fp=f(ff,:);  % points for this fit (n)
            xp=xd(ff,:); % points for this fit (n)
            qp=q(n*(ff-1)+1:n*ff); % part of input vector that contributes (n)
            % enter m derivatives
            a(ff)=   -sum( fp(:).*qp(:).*(xp(:)-p(1)))   /p(2)^2;   % mus, at positions [1:k]
            a(ff+k)=  sum( fp(:).*qp(:).*(xp(:)-p(1)).^2)/p(2)^3;   % sigmas, at [k+1:2*k]
            a(ff+2*k)=sum( fp(:).*qp(:))                 /p(3);     % as, at [2*k+1:3*k]
        end
    end
end

sarah goldberg on 4 Apr 2019

Open in MATLAB Online

I rechecked, fixed a minus, and reran (update code below).

1) For more than 1 Gaussian, it is still definitely off, so I assume I have a mistake with the indexing (mixing of derivatives between fits). Is there a way to get the residual of the fit at each iteration? It is hard to compare the ND to the loop-over-1D result without that information.

2) For a single gaussian, the fit looks ok visually

but the numbers look different (1D, ND, ND + Jacobian). Should I expect the convergence to be faster with the multiplication function supplied?

                                         Norm of      First-order 
 Iteration  Func-count     f(x)          step          optimality
        4         84.1924                          37.1
        8          1.6395        1.90159           2.76      
       12      0.00633328       0.609387          0.405      
       16      5.7509e-09      0.0147914       0.000164      
       20     6.22034e-20     3.6403e-05       1.27e-09   
                                        Norm of      First-order 
 Iteration  Func-count     f(x)          step          optimality
        4         84.1924                          37.1
        8          1.6395        1.90159           2.76      
       12      0.00633328       0.609387          0.405      
       16      5.7509e-09      0.0147914       0.000164      
       20     6.22034e-20     3.6403e-05       1.27e-09    
                                         Norm of      First-order 
 Iteration  Func-count     f(x)          step          optimality
        1         84.1924                          37.1
        2            4.17         1.8473           3.69      
        3        0.479198       0.924628            3.4      
        4        0.015558        0.12554          0.237      
        5     0.000387441      0.0587371         0.0974      
        6     8.56577e-06     0.00357684        0.00558      
        7     1.86572e-07     0.00138134        0.00214      
        8     4.05201e-09    7.85018e-05       0.000121 

3) I also saw that the Jacobian returned by lsqcurvefit is of the wrong dimension. For the 1D and ND, I get the expected dimension

size(jacobian)=(100,3)

and for the ND+Jacobian case I get

size(jacobian)=(100,1)

Is this part of the same problem? Or is it because the full Jacobian is not evaluated?

Again, Thanks.

Sarah

(most recent version of the code)

clear all; close all; fclose all;
% model is a*exp(-0.5*((x-m)/s)^2), with different a, m, s for each row
num_g=1; % number of gaussians (rows)
maxind=min(10,num_g); % number of gussians we will plot
a0=4; 
a_std=5;
mu0=40;
mu_std=30;
sigma0=3;
sigma_std=1;
xx=[1:1:100]; % these are the pixel values at which we will have data
alg= 'trust-region-reflective' ; % 'trust-region-reflective' or 'levenberg-marquardt'
displaySetting='off'; % 'iter', 'off', etc.
% % generate random data
% mus=mu0 + mu_std*randn(1,num_g);
% sigmas=sigma0 + sigma_std*randn(1,num_g);
% as=a0 + a_std*randn(1,num_g);
% sigmas(sigmas==0)=sigma0; % to avoid division by 0
% generate ordered data (for debugging)
mus=mu0 + mu_std*[1:num_g]/num_g;
sigmas=sigma0 + sigma_std*[1:num_g]/num_g;
as=a0 + a_std*[1:num_g]/num_g;
% 1D data - each column is a different object
mu_mat=ones(size(xx,2),1)*mus;
sigma_mat=ones(size(xx,2),1)*sigmas;
a_mat=ones(size(xx,2),1)*as;
xx_mat=xx'*ones(1,size(mus,2));
yy_mat=a_mat.*exp( - (((xx_mat-mu_mat)./sigma_mat).^2)/2 );
%% 1D fit, looped over rows (for comparison)
fun_1D=@(x,xdata)x(3)*exp( - (((xdata-x(1))./x(2)).^2)/2 ); % gaussian model
opts1=optimset('display',displaySetting,'Algorithm',alg);
p_guess=nan*ones(num_g,3); % guess
pfit=nan*ones(num_g,3+2); % fit results
% find starting points==guesses
for ii=1:num_g
    [maxval,maxpos]=max(yy_mat(:,ii));
    [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
    p_guess(ii,:)=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval];
end
% fit
tic
for ii=1:num_g
    [x,resnorm,residual,exitflag,output1,lambda1,jacobian1]=lsqcurvefit(fun_1D,p_guess(ii,:),xx_mat(:,ii)',yy_mat(:,ii)',[],[],opts1);
    pfit(ii,:)=[x,resnorm,exitflag];
end
fprintf(1,['\n1D, n_g=' num2str(num_g) '\n']);
toc
% plot first 10 to check
figure(1);clf;hold on;set(gcf,'position',[  680   134   907   844]);
subplot(311);hold on;
title('1D');
xx_dense=[min(xx_mat):0.1:max(xx_mat)];
for ii=1:min(num_g,10)
    ph=plot(xx_mat(:,ii),yy_mat(:,ii),'.'); % data
    plot(xx_dense,fun_1D(p_guess(ii,1:3),xx_dense),':','color',get(ph,'color')); % guess
    plot(xx_dense,fun_1D(pfit(ii,1:3),xx_dense),'-','color',get(ph,'color')); % fit
end
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND without Jacobian multiplication function
fun_ND=@(x,xdata)x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
opts1=optimset('display',displaySetting,'Algorithm',alg);
p0_ND=nan*ones(num_g,3); % guess
p0_ND=p_guess; % same as 1D
pfit_ND=nan*ones(num_g,3+2); % fit results
tic
[pfit_ND,resnormN,residualN,exitflagN,outputN,lambdaN,jacobianN]=lsqcurvefit(fun_ND,p0_ND,xx_mat',yy_mat',[],[],opts1); % each row is 1 object
fprintf(1,['\nND, no J supplied, n_g=' num2str(num_g) '\n']);
toc
% plot first 10 to check
figure(1);
subplot(312);hold on;
title('ND');
yy_fit=fun_ND(pfit_ND,xx_mat'); yy_fit=yy_fit';
yy_guess=fun_ND(p0_ND,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'-');
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND fit with Jacobian multiplication function
global G_f G_P G_n G_k;
opts2 = optimoptions('lsqcurvefit','display',displaySetting,'Algorithm',alg,'tolfun',1e-15,'SpecifyObjectiveGradient',true,'JacobianMultiplyFcn',@(Jinfo,Y,flag,xdata)myJ(Jinfo,Y,flag));
p0_ND_J=nan*ones(num_g,3); % guess
p0_ND_J=p_guess; % same as 1D
pfit_ND_J=nan*ones(num_g,3+2); % fit results
p0_ND_J_vec=[p0_ND_J(:,1) p0_ND_J(:,2) p0_ND_J(:,3)]; % vectorized to
%define order: all mu, then all sigma, then all a. can be entered as  guess
%instead of p0_ND_J, does not make a difference
tic
[pfit_ND_J,resnorm,residual,exitflag,outputJM,lambdaJM,jacobianJM]=lsqcurvefit(@fun_ND_withJ, p0_ND_J_vec, xx_mat', yy_mat',[],[],opts2); % each row is 1 object
fprintf(1,['\nND, J multiplication supplied, n_g=' num2str(num_g) '\n']);
toc
% plot to check
figure(1);
subplot(313);hold on;
title('ND with Jacobian');
maxind=min(10,num_g);
yy_fit=fun_ND_withJ(pfit_ND_J,xx_mat');yy_fit=yy_fit';
yy_guess=fun_ND_withJ(p0_ND_J,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);hold on;
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'-');
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
% model for ND fit with Jacobian multiplication
% x is             num_g x num_params_per_fit (num_g x 3)
% xdata is         num_g x num_points_per_fit (num_g x 100)
% f is             num_g x num_points_per_fit (num_g x 100)
function [f,Jinfo] = fun_ND_withJ(x,xdata)
    global G_f G_P G_n G_k G_xd;
    % update globals
    G_P=x;
    G_xd=xdata;
    G_n=size(xdata,2); % number of data points
    G_k=size(xdata,1); % number of fits
    % Jinfo sparse matrix for preconditioning - required input.
    % size should equal the size of the Jacobian C.
    Jinfo = speye(G_n*G_k, size(x,1)*G_k); 
    %fun_ND=@(x,xdata)x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
    f=                x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
    G_f=f;
end
% compute Jacobian multiply functions. 
% note: Jacobian C is sparse. the only nonzero elements are 
% derivatives of points xi belonging to fit j 
% with respect to their own fitting parameters [mj,sj,aj].
% for xi "connected" to params Pj=[mj,sj,aj], we have:
% for mu==mj:        dfi/dmj =  fi*(xi-mj)/sj^2
% for sigma==sj:     dfi/dsj =  fi*(xi-mj)^2/sj^3
% for a==aj:         dfi/daj =  fi/aj    
% we need f=f(xdata(1);xdata(2);...;xdata(num_g);P) and parameters Pj for all j
function w = myJ(Jinfo,Y,flag)
    global G_f G_P G_n G_k G_xd;
    P=G_P;
    f=G_f;
    n=G_n;
    k=G_k;
    xd=G_xd;
    
    if flag > 0
        %'pos'
        w = Jpositive(Y,f,P,n,k,xd);
    elseif flag < 0
        %'neg'
        w = Jnegative(Y,f,P,n,k,xd);
    else
        %'comp'
        w1= Jpositive(Y,f,P,n,k,xd);
        w = Jnegative(w1,f,P,n,k,xd);
    end
    
    function a = Jpositive(q,f,P,n,k,xd)
        % Calculate C*q (dimensions NxP, Px1 => Nx1) 
        a = zeros(prod(size(f)),1);
        for ff=1:k % loop over fits, calculate n sums for each fit
            fp=f(ff,:);  % f of points for this fit (n)
            xp=xd(ff,:); % points for this fit (n)
            p=P(ff,:);   % params for this fit (m)
            qp=q([ff + (size(P,1))*[0, 1, 2]] ); % part of input vector q that contributes (m)
            a([(n*(ff-1)+1):n*ff])=fp.*( qp(1)*(xp-p(1))/p(2)^2 + qp(2)*(xp-p(1)).^2/p(2)^3+ qp(3)/p(3)); % enter n derivatives 
        end
    end
    function a = Jnegative(q,f,P,n,k,xd)
        % Calculate C'*q (dimensions PxN, Nx1 => Px1) 
        a = zeros(prod(size(P)),1); % Px1
        for ff=1:k % loop over fits, calculate 3 sums for 3 fit parameters
            p=P(ff,:) ;           % params for this fit (m)
            fp=f(ff,:);  % points for this fit (n)
            xp=xd(ff,:); % points for this fit (n)
            qp=q(n*(ff-1)+1:n*ff); % part of input vector that contributes (n)
            % enter m derivatives
            a(ff)=    sum( fp(:).*qp(:).*(xp(:)-p(1)))   /p(2)^2;   % mus, at positions [1:k]
            a(ff+k)=  sum( fp(:).*qp(:).*(xp(:)-p(1)).^2)/p(2)^3;   % sigmas, at [k+1:2*k]
            a(ff+2*k)=sum( fp(:).*qp(:))                 /p(3);     % as, at [2*k+1:3*k]
        end
    end
end

sarah goldberg on 11 Apr 2019

Open in MATLAB Online

Hi Matt,

I got the multiple-Gaussian example with Jacobian multiply function working (code below) and think the example might be useful to others. Feel free to edit the Q and A.

My only issues, please comment:

1) The error is larger when the Jacobian multiply function is supplied. I do not think there is an error in the derivative or in the indexing.

2) The time for the Jacobian multiply is slower than for supplying the exact sparse Jacobian.

3) What would you recommend for N fits that do not have the same number of data points?

Thanks for your help!

Sarah

Visual fits for the different options (looped 1D, looped 1D+Jacobian, ND, ND + Jacobian, ND + Jacobian multiply function)

Times

Elapsed time is 0.467018 seconds.
1D, n_g=20
resnorm1 =
     0
Elapsed time is 0.180804 seconds.
1D + J, n_g=20
resnormJ =
     0
Elapsed time is 0.094740 seconds.
ND, n_g=20
resnormN =
   1.1678e-30
Elapsed time is 0.033412 seconds.
ND+J, n_g=20
resnormNJ =
   9.0389e-31
Elapsed time is 0.319316 seconds.
ND + J mult. func., n_g=20
resnormJM =
   2.1279e-10

code

clear all; close all; fclose all; clc;
global history; % for storing fit data
% model params. model is
% a*exp(-0.5*((x-m)/s)^2)
% with different a, m, s for each row
num_g=20;              % number of gaussians (rows)
maxind=min(10,num_g); % number of gussians we will plot
npoints=30;
a0=1; 
a_std=5;
mu0=npoints/4;
mu_std=0.6*npoints;
sigma0=1;
sigma_std=2;
xx=[1:1:npoints]; % these are the pixel values at which we will have data
% fit params
alg='trust-region-reflective' ; % 'trust-region-reflective' or 'levenberg-marquardt'
displaySetting='off'; % 'iter', 'off', 'final', etc.
maxIter=1e3; % for debugging
maxFunEvals=1e3;
tolfun=1e-10;
% % generate random data
% mus=mu0 + mu_std*randn(1,num_g);
% sigmas=sigma0 + sigma_std*randn(1,num_g);
% as=a0 + a_std*randn(1,num_g);
% sigmas(sigmas==0)=sigma0; % to avoid division by 0
% generate ordered data (for debugging)
mus=mu0 + mu_std*[1:num_g]/num_g;
sigmas=sigma0 + sigma_std*[1:num_g]/num_g;
as=a0 + a_std*[1:num_g]/num_g;
% 1D data - each column is a different object
mu_mat=ones(size(xx,2),1)*mus;
sigma_mat=ones(size(xx,2),1)*sigmas;
a_mat=ones(size(xx,2),1)*as;
xx_mat=xx'*ones(1,size(mus,2));
yy_mat=a_mat.*exp( - (((xx_mat-mu_mat)./sigma_mat).^2)/2 );
%% 1D fit, looped over rows (for comparison)
opts1=optimoptions('lsqcurvefit','display',displaySetting,'Algorithm',alg,'tolfun',tolfun,'MaxIter',maxIter,'MaxFunEvals',maxFunEvals,'OutputFcn',@myoutput);
p_guess=nan*ones(num_g,3); % guess
pfit=nan*ones(num_g,3+2); % fit results
% find starting points==guesses
for ii=1:num_g
    [maxval,maxpos]=max(yy_mat(:,ii));
    [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
    p_guess(ii,:)=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval];
end
history=[];
history1=history;
% fit
tic
for ii=1:num_g
    [x1,resnorm1,residual1,exitflag1,output1,lambda1,jacobian1]=lsqcurvefit(@fun_1D,p_guess(ii,:),xx_mat(:,ii)',yy_mat(:,ii)',[],[],opts1);
    pfit(ii,:)=[x1,resnorm1,exitflag1];
    history1=[history1; history];
end
toc
fprintf(1,['1D, n_g=' num2str(num_g) '\n']);
resnorm1
% plot first 10 to check
figure(1);clf;hold on;set(gcf,'position',[  680   134   907   844]);
subplot(511);hold on;
title('1D');
xx_dense=[min(xx_mat):0.1:max(xx_mat)];
for ii=1:min(num_g,10)
    ph=plot(xx_mat(:,ii),yy_mat(:,ii),'.'); % data
    plot(xx_dense,fun_1D(p_guess(ii,1:3),xx_dense),':','color',get(ph,'color')); % guess
    plot(xx_dense,fun_1D(pfit(ii,1:3),xx_dense),'-','color',get(ph,'color')); % fit
end
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
% find starting points==guesses
for ii=1:num_g
    [maxval,maxpos]=max(yy_mat(:,ii));
    [~,minpos]=min(abs(yy_mat(:,ii) - maxval/2));
    p_guess(ii,:)=[xx_mat(maxpos,ii),abs(minpos-maxpos)+1,maxval];
end
%% 1D + J
opts1J=optimoptions('lsqcurvefit','SpecifyObjectiveGradient',true,'tolfun',tolfun,'Jacobian','on','display',displaySetting,'Algorithm',alg,'MaxIter',maxIter,'MaxFunEvals',maxFunEvals,'OutputFcn',@myoutput);
history=[];
history1J=history;
fprintf(1,'\n');
% fit
tic
for ii=1:num_g
    [x,resnormJ,residual,exitflagJ,output1J,lambda1J,jacobian1J]=lsqcurvefit(@fun_1D_J,p_guess(ii,:),xx_mat(:,ii)',yy_mat(:,ii)',[],[],opts1J);
    pfit(ii,:)=[x,resnormJ,exitflagJ];
    history1J=[history1J; history];
end
toc
fprintf(1,['1D + J, n_g=' num2str(num_g) '\n']);
resnormJ
% plot first 10 to check
figure(1);hold on;set(gcf,'position',[  680   134   907   844]);
subplot(512);hold on;
title('1D + J');
xx_dense=[min(xx_mat):0.1:max(xx_mat)];
for ii=1:min(num_g,10)
    ph=plot(xx_mat(:,ii),yy_mat(:,ii),'.'); % data
    plot(xx_dense,fun_1D_J(p_guess(ii,1:3),xx_dense),':','color',get(ph,'color')); % guess
    plot(xx_dense,fun_1D_J(pfit(ii,1:3),xx_dense),'-','color',get(ph,'color')); % fit
end
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND without Jacobian or Jacobian multiplication function
optsN=optimoptions('lsqcurvefit','display',displaySetting,'Algorithm',alg,'tolfun',tolfun,'MaxIter',maxIter,'MaxFunEvals',maxFunEvals,'OutputFcn', @myoutput);
p0_ND=nan*ones(num_g,3); % guess
p0_ND=p_guess; % same as 1D
pfit_ND=nan*ones(num_g,3+2); % fit results
history=[];
fprintf(1,'\n');
tic
[pfit_ND,resnormN,residualN,exitflagN,outputN,lambdaN,jacobianN]=lsqcurvefit(@fun_ND,p0_ND,xx_mat',yy_mat',[],[],optsN); % each row is 1 object
toc
fprintf(1,['ND, n_g=' num2str(num_g) '\n']);
resnormN
historyN=history;
% plot first 10 to check
figure(1);
subplot(513);hold on;
title('ND');
yy_fit=fun_ND(pfit_ND,xx_mat'); yy_fit=yy_fit';
yy_guess=fun_ND(p0_ND,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'-');
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND with Jacobian 
optsNJ=optimoptions('lsqcurvefit','SpecifyObjectiveGradient',true,'Jacobian','on','display',displaySetting,'Algorithm',alg,'tolfun',tolfun,'MaxIter',maxIter,'MaxFunEvals',maxFunEvals,'OutputFcn',@myoutput);
p0_ND_J=nan*ones(num_g,3); % guess
p0_ND_J=p_guess; % same as 1D
pfit_ND_J=nan*ones(num_g,3+2); % fit results
history=[];
fprintf(1,'\n');
tic
[pfit_ND_J,resnormNJ,residualNJ,exitflagNJ,outputNJ,lambdaNJ,jacobianNJ]=lsqcurvefit(@fun_ND_J,p0_ND_J,xx_mat',yy_mat',[],[],optsNJ); % each row is 1 object
toc
fprintf(1,['ND+J, n_g=' num2str(num_g) '\n']);
resnormNJ
historyNJ=history;
% plot first 10 to check
figure(1);
subplot(514);hold on;
title('ND+J');
yy_fit=fun_ND_J(pfit_ND_J,xx_mat'); yy_fit=yy_fit';
yy_guess=fun_ND_J(p0_ND_J,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'-');
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%% ND fit with Jacobian multiplication function
opts2 = optimoptions('lsqcurvefit','MaxIter',maxIter,'MaxFunEvals',maxFunEvals,'OutputFcn', @myoutput,'display',displaySetting,'Algorithm',alg,'tolfun',tolfun,'SpecifyObjectiveGradient',true,'JacobianMultiplyFcn',@(Jinfo,Y,flag,xdata)myJ(Jinfo,Y,flag));
p0_ND_JM=nan*ones(num_g,3); % guess
p0_ND_JM=p_guess; % same as 1D
pfit_ND_JM=nan*ones(num_g,3+2); % fit results
p0_ND_J_vec=[p0_ND_J(:,1) p0_ND_J(:,2) p0_ND_J(:,3)]; % vectorized to
%define order: all mu, then all sigma, then all a. can be entered as  guess
%instead of p0_ND_J, does not make a difference
history=[];
fprintf(1,'\n');
tic
[pfit_ND_JM,resnormJM,residualJM,exitflagJM,outputJM,lambdaJM,jacobianJM]=lsqcurvefit(@fun_ND_Jmult, p0_ND_JM, xx_mat', yy_mat',[],[],opts2); % each row is 1 object
toc
fprintf(1,['ND + J mult. func., n_g=' num2str(num_g) '\n']);
resnormJM
historyNJmult=history;
% plot to check
figure(1);
subplot(515);hold on;
title('ND + Jacobian Multiply');
maxind=min(10,num_g);
yy_fit=fun_ND_Jmult(pfit_ND_J,xx_mat');yy_fit=yy_fit';
yy_guess=fun_ND_Jmult(p0_ND_J,xx_mat');yy_guess=yy_guess';
plot(xx_mat(:,1:maxind),yy_guess(:,1:maxind),':','color',0.5*[1, 1, 1]);hold on;
plot(xx_mat(:,1:maxind),yy_fit(:,1:maxind),'-');
plot(xx_mat(:,1:maxind),yy_mat(:,1:maxind),'.','color',0*[1, 1, 1]);hold on;
xlabel('xx [pixel]');
ylabel('yy [intensity]');
%%%%%%% functions below %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% 1D
function [f]=fun_1D(x,xdata)
    f=x(3)*exp( - (((xdata-x(1))./x(2)).^2)/2 ); % gaussian model
end
% 1D + jacobian
function [f,J]=fun_1D_J(x,xdata)
    G_n=size(xdata,2); % number of data points
    G_k=size(xdata,1); % number of fits
    f = x(3)*exp( - (((xdata-x(1))./x(2)).^2)/2 ); % gaussian model
    J = spalloc(G_n*G_k, size(x,1)*G_k, G_n*G_k*3); 
    for ff=1:G_k % loop over fits, calculate n sums for each fit
        fp=f(ff,:);  % f of points for this fit (n)
        xp=xdata(ff,:); % points for this fit (n)
        p=x(ff,:);   % params for this fit (m)
        J( ff:G_k:end ,1)=fp.*(xp-p(1))/p(2)^2;
        J( ff:G_k:end ,2)=fp.*(xp-p(1)).^2/p(2)^3;
        J( ff:G_k:end ,3)=fp/p(3);
    end
end
% ND
function [f]=fun_ND(x,xdata)
    f=x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
end
% ND + jacobian
function [f,J]=fun_ND_J(x,xdata)
    f=x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
 
    G_n=size(xdata,2); % number of data points
    G_k=size(xdata,1); % number of fits
    G_p=numel(x)/G_k;  % number of params per fit
    J = spalloc(G_n*G_k, G_p*G_k, G_n*G_k*3); 
    for ff=1:G_k        % loop over fits, calculate 3*n derivatives for each fit
        fp=f(ff,:);     % f of points for this fit (n)
        xp=xdata(ff,:); % points for this fit (n)
        p=x(ff,:);      % params for this fit (m)        
        J(ff:G_k:end,        ff)=fp.*(xp-p(1))/p(2)^2;    % df/dmu      (may be negative)
        J(ff:G_k:end,   G_k+ ff)=fp.*(xp-p(1)).^2/p(2)^3; % df/dsigma   (non-negative)
        J(ff:G_k:end, 2*G_k+ ff)=fp/p(3);                 % df/da       (non-negative)
    end
end
% model for ND fit with Jacobian multiplication
% x is             num_g x num_params_per_fit (num_g x 3)
% xdata is         num_g x num_points_per_fit (num_g x 100)
% f is             num_g x num_points_per_fit (num_g x 100)
function [f,Jinfo] = fun_ND_Jmult(x,xdata)
    global G_f G_P G_n G_k G_xd;   % needed for jacobian multiply function
    
    f=x(:,3)*ones(1,size(xdata,2)).*exp(- (((xdata - (x(:,1)*ones(1,size(xdata,2))))./ (x(:,2)*ones(1,size(xdata,2)))).^2)/2 ); % gaussian model
    % update globals
    G_P=x;
    G_xd=xdata;
    G_n=size(xdata,2); % number of data points
    G_k=size(xdata,1); % number of fits
    G_p=numel(x)/G_k;  % number of params per fit
    G_f=f;
    
    % Jinfo sparse matrix for preconditioning, required even when jacobian
    % multiply is used. size should equal the size of the jacobian.
    Jinfo = spalloc(G_n*G_k, G_p*G_k, G_n*G_k*3); % all zeros
end
% compute Jacobian multiply functions. 
% note: Jacobian C is sparse. the only nonzero elements are 
% derivatives of points xi belonging to fit j 
% with respect to their own fitting parameters [mj,sj,aj].
% for xi "connected" to params Pj=[mj,sj,aj], we have:
% for mu==mj:        dfi/dmj =  fi*(xi-mj)/sj^2
% for sigma==sj:     dfi/dsj =  fi*(xi-mj)^2/sj^3
% for a==aj:         dfi/daj =  fi/aj    
% we need f=f(xdata(1);xdata(2);...;xdata(num_g);P) and parameters Pj for all j
function w = myJ(Jinfo,Y,flag)
    global G_f G_P G_n G_k G_xd;
    
    P=G_P;
    f=G_f;
    n=G_n;
    k=G_k;
    xd=G_xd;
    
    if flag > 0
        w = Jpositive(Y,f,P,n,k,xd);
    elseif flag < 0
        w = Jnegative(Y,f,P,n,k,xd);
    else
        w1= Jpositive(Y,f,P,n,k,xd);
        w = Jnegative(w1,f,P,n,k,xd);
    end
    
    function a = Jpositive(q,f,P,n,k,xd)
        % Calculate C*q (dimensions NxP, Px1 => Nx1) 
        a = zeros(prod(size(f)),1);
        for ff=1:k % loop over fits, calculate n sums for each fit
            fp=f(ff,:);  % f of points for this fit (n)
            xp=xd(ff,:); % points for this fit (n)
            p=P(ff,:);   % params for this fit (m)
            qp=q([ff + k*[0:(numel(p)-1)]]); % part of input vector q that contributes (m) 
            a(ff:k:end)=fp.*( qp(1)*(xp-p(1))/p(2)^2 + qp(2)*(xp-p(1)).^2/p(2)^3+ qp(3)/p(3)); % enter n derivatives 
        end
    end
    function a = Jnegative(q,f,P,n,k,xd)
        % Calculate C'*q (dimensions PxN, Nx1 => Px1) 
        a = zeros(prod(size(P)),1); % Px1
        for ff=1:k % loop over fits, calculate 3 sums for 3 fit parameters
            p=P(ff,:);   % params for this fit (m)
            fp=f(ff,:);  % points for this fit (n)
            xp=xd(ff,:); % points for this fit (n)
            qp=q(ff:k:end); % part of input vector that contributes (n)
            % enter m derivatives
            a(ff)=    sum(fp(:).*qp(:).*(xp(:)-p(1)))   /p(2)^2 ;  % mus, at positions [1:k]
            a(ff+k)=  sum(fp(:).*qp(:).*(xp(:)-p(1)).^2)/p(2)^3 ;  % sigmas, at [k+1:2*k]
            a(ff+2*k)=sum(fp(:).*qp(:))                 /p(3)   ;  % as, at [2*k+1:3*k]
        end
    end
end
% append history if state=='iter'
function stop = myoutput(x,optimvalues,state)
    global history;
    stop = false;
    if isequal(state,'iter')
        history = [history; x];
    end
end

Matt J on 11 Apr 2019

Edited: Matt J on 11 Apr 2019

1) The error is larger when the Jacobian multiply function is supplied. I do not think there is an error in the derivative or in the indexing.

Assuming you're right, I would say the resnorm differences seen here are just "stopping noise": the iterations of the different computation methods take different paths toward the solution and stop wherever the stopping tolerances happen to be satisfied. The resnorm is not uniform over the range of these different possible stopping points, so you see essentially random differences in the resnorm between methods. Of course, it may be worth taking a look at the exitflag in each case...

I any event, it doesn't appear that JacobMult was ever capable of doing better in terms of accuracy than the default method. The worry that SpecifyObjectiveGradient tries to address, aside from speed, is that default finite difference calculations may limit the accuracy of the fit in some cases. But your tests show that that wasn't the case here. The finite differencing derivative calculations were good enough to reach an essentially zero resnorm, so there was no real room for improvement.

2) The time for the Jacobian multiply is slower than for supplying the exact sparse Jacobian.

Your Jacobian multiply code has various sub-efficiencies in it: for-looping, quantities like fp(:).*qp(:) or xp(:)-p(1) that are computed multiple times per k when they could be computed only once, subsref operations like fp=f(ff,:) which necessitate repeated memory allocation operations, etc...

3) What would you recommend for N fits that do not have the same number of data points?

You may find that it is not an issue once you start optimizing your computations and getting rid of your for-loops.

sarah goldberg on 6 May 2019

Open in MATLAB Online

Hi Matt,

I've been trying to generalize the ND Jacobian solution, with Jacobian not calculated/calculated/only Jacobian multiply calculated. The Jacobian multiply option is currently not working.

Here k is the number of 1D fits with n points and p parameters, N=n x k, P=p x k. Jacobian size is indeed (N x P) - I checked the output of lsqcurvefit.

I'm running into the following issue with the Jacobian multiply function (myJ): the size of the input vector Y is either (Nx1) or (Px1) or (Px2). How is this possible?

Code below. I am simply looking at

size(Y)

Thanks,

Sarah

clear all; close all; fclose all; clc;
s=rng('shuffle');
rng(828016308,'twister');
global history;
history=[];
% plot
doPlot=false; %true;
num_plot=4; % how many fits to plot
% fit params
alg='trust-region-reflective' ; % 'trust-region-reflective' or 'levenberg-marquardt'
displaySetting='off'; % 'iter', 'off', 'final', etc.
maxIter=1e5; 
maxFunEvals=1e5;
tolfun=1e-6;
% options
opts   = optimoptions('lsqcurvefit','MaxIter',maxIter,'MaxFunEvals',maxFunEvals,'OutputFcn', @myoutput,'display',displaySetting,'Algorithm',alg,'tolfun',tolfun);
optsJ  = optimoptions('lsqcurvefit','MaxIter',maxIter,'MaxFunEvals',maxFunEvals,'OutputFcn', @myoutput,'display',displaySetting,'Algorithm',alg,'tolfun',tolfun,'Jacobian','on');
optsJm = optimoptions('lsqcurvefit','MaxIter',maxIter,'MaxFunEvals',maxFunEvals,'OutputFcn', @myoutput,'display',displaySetting,'Algorithm',alg,'tolfun',tolfun,'SpecifyObjectiveGradient',true,'JacobianMultiplyFcn',@(Jinfo,Y,flag,xdata)myJ(Jinfo,Y,flag));
% THE MODEL. generate function handles for evaluating f (f_func) and its partial derivatives (dfdp_func).
[f_func,dfdp_func,num_params]=sym_model_derivatives();
% general 2D model
num_g=2; % number of gaussians
npoints=[3,5];   % image size [r,c]=[y,x], ROI intentionally not symmetric for debugging
min_width=1; % so we don't have sigma==0
a0=30;
a_std=1;
mu_std=0.1;
sigma0=[2,1]; % be careful, do not make these too small!
sigma_std=0.1;
theta=0*(pi/180); % in radians
theta_std=10*(pi/180);
bg=10;
bg_std=5;
mu0=npoints/2;   % place gaussians roughly at center 
im=zeros(npoints);  
% generate data: num_g images of size(im)
ims=zeros(num_g,size(im,1),size(im,2));   % image set size
mus=mu0'*ones(1,num_g) + mu_std*randn(2,num_g);          % 1 per dimension
sigmas=sigma0'*ones(1,num_g) + sigma_std*randn(2,num_g); % 1 per dimension
as=a0 + a_std*randn(1,num_g);             % 1 per gaussian
sigmas(sigmas<0.1)=min_width;             % to avoid division by 0
thetas=theta+theta_std*randn(1,num_g);
bgs=bg+bg_std*randn(1,num_g);
[X,Y]=meshgrid(1:size(im,2),1:size(im,1));  % x-y coordinates - weird order so that data can be reshaped correctly 
x_ind=X(:);               % x, column vector
y_ind=Y(:);               % y, column vector
for mm=1:num_g
    % parameter order: %Amplitude    % Row (y)      % Col (x)      % Theta -3* 3*     % Sigma_row (y)       % Sigma_col (x)   % Background
    c0s(mm,:)=[as(mm) mus(2,mm) mus(1,mm) thetas(mm) sigmas(2,mm) sigmas(1,mm)  bgs(mm)];
    ims(mm,:)=model_func(c0s(mm,:),x_ind,y_ind,f_func);    % 1D vector per 2D plot
end
% guesses
guesses=zeros(num_g,num_params);
% parameter order: %Amplitude    % Row      % Col      % Theta -3* 3*     % Sigma_row       % Sigma_col    % Background
for mm=1:num_g
    [maxval,~]=max(ims(mm,:));
    tmp_im=reshape(ims(mm,:)',size(im,1),size(im,2));
    [xpos,ypos]=find(tmp_im==maxval); 
    [~,half_x]=min(abs(tmp_im(ypos,:) - maxval/2));
    [~,half_y]=min(abs(tmp_im(:,xpos) - maxval/2));
    guesses(mm,:)=[maxval xpos ypos 0 (abs(half_x(1)-xpos)+min_width) (abs(half_y(1)-ypos)+min_width) 0]; % adding something small to avoid 0
end
%% N x 2D - setup
im_data=zeros(num_g,prod(size(im)));
xdata=kron(ones(1,num_g),x_ind)';
ydata=kron(ones(1,num_g),y_ind)';
guesses_ND=zeros(num_g,num_params);
% images
for mm=1:num_g
     im_data(mm,:)=ims(mm,:);
end
% guesses
guesses_ND=guesses;
%% N x 2D
history=[];
results_ND=zeros(num_g,num_params);
fprintf(1,['\n']);
tic
f_ND = @(c,xdata)model_func_ND(c,xdata,ydata,f_func);  % need to pass x_ind, y_ind to model_func
[results_ND,resnormN,residualN,exitflagN,outputN,lambdaN,jacobianN]=lsqcurvefit(f_ND,guesses_ND,xdata,im_data,[],[],opts); % each row is 1 object
toc
fprintf(1,['Nx2D\n']);
resnormN
historyN=history;
exitflagN
%% N x 2D + Jacobian
results_ND_J=zeros(num_g,num_params);
fprintf(1,['\n']);
tic
f_ND = @(c,xdata)model_func_ND_J(c,xdata,ydata,f_func,dfdp_func);  % need to pass args x_ind, y_ind, etc. to model_func
[results_ND_J,resnormN_J,residualN_J,exitflagN_J,outputN_J,lambdaN_J,jacobianN_J]=lsqcurvefit(f_ND,guesses_ND,xdata,im_data,[],[],optsJ); % each row is 1 object
toc
fprintf(1,['Nx2D + Jacobian\n']);
resnormN_J
exitflagN_J
%% N x 2D + Jacobian multiply
history=[];
results_ND_J=zeros(num_g,num_params);
fprintf(1,['\n']);
tic
f_ND = @(c,xdata)model_func_ND_Jm(c,xdata,ydata,f_func,dfdp_func);  % need to pass args x_ind, y_ind, etc. to model_func
[results_ND_Jm,resnormN_Jm,residualN_Jm,exitflagN_Jm,outputN_Jm,lambdaN_Jm,jacobianN_Jm]=lsqcurvefit(f_ND,guesses_ND,xdata,im_data,[],[],optsJm); % each row is 1 object
toc
fprintf(1,['Nx2D + Jacobian multiply\n']);
resnormN_Jm
historyNm=history;
exitflagN_Jm
return
%% ----- functions -------
% model: 
%   f = Amplitude*.*exp(- ( A*(x-Col).^2 + 2*B*(x-Col).*(y-Row) + C*(y-Row).^2 ) ) + Background
%   A = cos(Theta)^2/(2*Sigma_col)      + sin(Theta)^2/(2*Sigma_row)
%   B = sin(2*Theta) * (1/Sigma_row^2 - 1/Sigma_col^2)/4;
%   C = sin(Theta)^2/(2*Sigma_col)      + cos(Theta)^2/(2*Sigma_row);
% parameter order: %Amplitude    % Row      % Col      % Theta -3* 3*     % Sigma_row       % Sigma_col    % Background
function [f_func,df,num_params]=sym_model_derivatives()
    num_params=7;
    p=sym('p',[1 num_params]);  
    syms f x y A B C tmp_df;
    
    % model definition HERE
    A=cos(p(4))^2/(2*p(6)^2)      + sin(p(4))^2/(2*p(5)^2);
    B=sin(2*p(4)) * (1/p(5)^2     - 1/p(6)^2)/4;
    C=sin(p(4))^2/(2*p(6)^2)      + cos(p(4))^2/(2*p(5)^2);
    f = p(1)*exp(- ( A*(x-p(3))^2 + 2*B*(x-p(3))*(y-p(2)) + C*(y-p(2))^2 ) ) + p(7);
    % end of model definition
    
    vars=[p x y];
    for ii=1:length(p) 
        tmp_df=diff(f,p(ii));
        % convert dfdp to matlab function
        df{ii}=matlabFunction(tmp_df,'vars',vars); 
    end
    % convert f to matlab function
    f_func=matlabFunction(f,'vars',vars); 
end
% 2D model - used to generate data
function [f]=model_func(p,x,y,f_func) 
    p_list=num2cell(p); % to avoid p(1),p(2),...,p(n) argument passing
    f=f_func(p_list{:},x,y);
end
% Nx2D model
function [f]=model_func_ND(c,x,y,f_func)
   G_n=size(x,2);      % number of data points
   G_k=size(x,1);      % number of fits
   f = zeros(G_k, G_n);
   
   for ff=1:G_k
       xp=x(ff,:);  % points for this fit (n)
       yp=y(ff,:);
       cp=c(ff,:);  % params for this fit (m)
       cp_list=num2cell(cp);
       % f
       f(ff,:)=f_func(cp_list{:},xp,yp);
   end
end
% Nx2D + Jinfo model
function [f,Jinfo]=model_func_ND_info(c,x,y,f_func) 
    G_n=size(x,2);      % number of data points
    G_k=size(x,1);      % number of fits
    f = zeros(G_k, G_n);
    for ff=1:G_k
        xp=x(ff,:);  % points for this fit (n)
        yp=y(ff,:);
        cp=c(ff,:);  % params for this fit (m)
        cp_list=num2cell(cp);
        % f
        f(ff,:)=f_func(cp_list{:},xp,yp);
    end
    G_p=numel(c)/G_k;   % number of parameters per fit
    Jinfo = spalloc(G_n*G_k, G_p*G_k, 0);
end
% Nx2D + Jacobian
function [f,J]=model_func_ND_J(c,x,y,f_func,dfdp_func) 
    G_n=size(x,2);      % number of data points
    G_k=size(x,1);      % number of fits
    G_p=numel(dfdp_func);   % number of parameters per fit
    f = zeros(G_k, G_n);
    J = spalloc(G_n*G_k, G_p*G_k, G_n*G_k*G_p);
    
    % calculate fit values and partial derivatives for each fit 
    for ff=1:G_k
        xp=x(ff,:);  % points for this fit (n)
        yp=y(ff,:);
        cp=c(ff,:);  % params for this fit (m)
        cp_list=num2cell(cp);
        % f
        f(ff,:)=f_func(cp_list{:},xp,yp);
        % J
        for pp=1:length(dfdp_func)
            df=dfdp_func{pp}; % function handle of this partial derivative
            J(ff:G_k:end ,  (pp-1)*G_k + ff) = df(cp_list{:},xp,yp);  
            % full(J)
        end 
        %size(J)
    end
end
% Nx2D + Jacobian multiply
function [f,Jinfo] = model_func_ND_Jm(c,xdata,ydata,f_func,dfdp_func)
    global G_df_func G_P G_n G_k G_xd G_yd;   % needed for jacobian multiply function
    
    G_n=size(xdata,2);      % number of data points
    G_k=size(xdata,1);      % number of fits
    f = zeros(G_k, G_n);
    
    for ff=1:G_k
        xp=xdata(ff,:);  % points for this fit (n)
        yp=ydata(ff,:);
        cp=c(ff,:);  % params for this fit (m)
        cp_list=num2cell(cp);
        % f
        f(ff,:)=f_func(cp_list{:},xp,yp);
    end
    
    % update globals
    G_P=c;               % parameters
    G_p=numel(dfdp_func);% number of parameters per fit
    G_xd=xdata;          % indices 
    G_yd=ydata;
    G_df_func=dfdp_func; % partial derivative function handles
    
    % Jinfo sparse matrix for preconditioning, required even when jacobian
    % multiply is used. size should equal the size of the jacobian.
    Jinfo = spalloc(G_n*G_k, G_p*G_k, G_n*G_k*G_p); % all zeros
end
% compute Jacobian multiply functions. 
% note: Jacobian C is sparse. the only nonzero elements are 
% derivatives of points xi belonging to fit j 
% with respect to their own fitting parameters. there positions within the
% jacobian can be viewed by looking at the jacobian generated in model_func_ND_J
% using the command full(J).
function w = myJ(Jinfo,Y,flag)
    size(Y)  %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
    global G_df_func G_P G_n G_k G_xd G_yd;
    
    P=G_P;
    n=G_n;
    k=G_k;
    xd=G_xd;
    yd=G_yd;
    df_func=G_df_func;
    
    if flag > 0
        w = Jpositive(Y, df_func,P,n,k,xd,yd);
        
    elseif flag < 0
        w = Jnegative(Y, df_func,P,n,k,xd,yd);
    else
        w1= Jpositive(Y, df_func,P,n,k,xd,yd);
        w = Jnegative(w1,df_func,P,n,k,xd,yd);
    end
    
    
    function a = Jpositive(q,df_func,P,n,k,xd,yd)
        % Calculate C*q (dimensions NxP, Px1 => Nx1) 
        a = zeros(n*k,1);
        for ff=1:k % loop over fits, calculate n sums for each fit
            xp=xd(ff,:); % points for this fit (n)
            yp=yd(ff,:); 
            p=P(ff,:);       % params for this fit (m)
            qp=q(ff:k:end);  % values of q for this fit (m)
            sump=zeros(1,n);
            for pp=1:numel(df_func)
                df=df_func{pp};      % 1 of m function handles
                p_list=num2cell(p);
                dfp=df(p_list{:},xp,yp); % n partial derivatives (may have 1 value if derivative is a const)
                sump=sump+qp(pp)*dfp;   % add n terms for each of m params
            end
            a(ff:k:end)=sump'; %  n terms
        end
    end
    function a = Jnegative(q,df_func,P,n,k,xd,yd)
        % Calculate C'*q (dimensions PxN, Nx1 => Px1) 
        a = zeros(prod(size(P)),1); % Px1
        for ff=1:k % loop over fits, calculate 3 sums for 3 fit parameters
            xp=xd(ff,:); % points for this fit (n)
            yp=yd(ff,:); 
            p=P(ff,:);   % params for this fit (m)
            qp=q(ff:k:end); % part of input vector q that contributes for this fit (n)
            for pp=1:numel(df_func)
                df=df_func{pp};      % 1 of m function handles
                p_list=num2cell(p);
                dfp=df(p_list{:},xp,yp); % n partial derivatives
                a(ff+(pp-1)*k)=sum(qp'.*dfp); 
            end
        end
    end
end
% append history if state=='iter', useful for checking convergence
function stop = myoutput(x,optimvalues,state)
    global history;
    stop = false;
    if isequal(state,'iter')
        history = [history; x];
    end
end

Sign in to comment.

Answer 2

Joss Knight on 24 Nov 2018

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/431697-make-curve-fitting-faster#answer_348704

Edited: Joss Knight on 24 Nov 2018

It does rather depend on what you're doing. The functions polyfit and interp1 work with gpuArray inputs.

14 Comments
Show 12 older commentsHide 12 older comments

Matt J on 25 Nov 2018

Edited: Matt J on 26 Nov 2018

Open in MATLAB Online

Not everything that parallelizes well on multicore parallelizes on the GPU. The objective functions for optimization are often completely general (or user-specified), and typically non-scalar.

That's true, but there are usually opportunities to accelerate operations done inside and in between the user-specified objective function calls with gpuArray variables. The problem is that there are barriers currently to getting full benefit from that. I've tried to expand on this a bit using the code below (an over-simplifed example with a simple linear least squares objective).

The dificulty right now is, if I want to incorporate gpuArray functionality into my objective function (or constraints) in any of the Optimization Toolbox solvers, I have to transfer data to and from the GPU with every call to the objective. It would be better if I could just leave everything on the GPU throughout the optimization. Not only would that spare me the CPU/GPU data transfers, but also the matrix algebra operations done by fmincon with the grad and HessCPU output between calls to objectiveLeastSquares would be done on the GPU and would be faster as well.

    A=gpuArray(A);
    HessGPU=A.'*A;
    y=gpuArray(y);
    fmincon(@objectiveLeastSquares(x,A,y),  x0, ____);

 function [fval,grad,HessCPU] = objectiveLeastSquares(x,A,y,HessGPU)
 
   err=A*x-y;  %implicitly sends x to te GPU?
   
   fval=gather( norm(err)^2 );
   if nargout>1, grad=gather( A.'*err ); end
   if nargout>2, HessCPU = gather(Hess); end

Emiliano Rosso on 10 Nov 2022

Can we modify Matlab code or it's p-coded?

I mean to follow the routine and eliminate errors , type mismatch , ecc...

Thanks!

Bruno Luong on 10 Nov 2022

Edited: Bruno Luong on 10 Nov 2022

Just to chim in the discussion about Matt's request of enable optimization functions on GPU.

I think the feature will be mosy interesting for people working on image (2D-3D) processing: denoising, debluring where the number of parameters are t number of pixels. The cost function are sum of some local operator so very suitable for GPU architecture.

If the transfert data between CPU and GPU is NOT required at each iteration or each gradient computation, it will be benefit for the run time.

There might be an new class of optimization method that is still develipped by researchers, but I think one must think about better exploiting GPU archiecture in TMW optimzation toolbox.

Sign in to comment.

Make curve fitting faster

1 Comment
Show -1 older commentsHide -1 older comments

Accepted Answer

16 Comments
Show 14 older commentsHide 14 older comments

More Answers (1)

14 Comments
Show 12 older commentsHide 12 older comments

See Also

Categories

Tags

Products

Community Treasure Hunt

Make curve fitting faster

1 Comment Show -1 older commentsHide -1 older comments

Accepted Answer

16 Comments Show 14 older commentsHide 14 older comments

More Answers (1)

14 Comments Show 12 older commentsHide 12 older comments

See Also

Categories

Tags

Products

Community Treasure Hunt

1 Comment
Show -1 older commentsHide -1 older comments

16 Comments
Show 14 older commentsHide 14 older comments

14 Comments
Show 12 older commentsHide 12 older comments