nonlinear fit of experimental data

Question

0 votes

Dear MatLab Experts,

I would like to generate a nonlinear regression model to fit my experimental data 'Mk_Superf_FSF' as function of the independent variables 'MaxFDiam' and 'MinFDiam' which are respectively the max and min diameter of an arbirarily shaped closed and connected 2D surface. I also added the variable 'Area' which is obviously correlated to max and min diameters so I think it is not wise to use that as well.

I was suggested a linear fit for the experimental data (see attached picture). The 7th order polynomial p(x) fits the data very well but the suggested formula is non physical. In fact, the variable used is a sum of quantities with different units:

x = MaxFDiam * MinFDiam + Area / MaxFDiam + Area / MinFDiam + MaxFDiam / MinFDiam

I cannot assign a units to ithe resulting sum because the product of the two diameters has units [mm^2.] whereas the Area/MaxDIameter has units [mm], the ratio of the two diamters is unitless.

I tried to fit a sum of two negative exponentials where in the exponent I have the Area ad the product of the diameters respectively. MatLab complained printing out that the Jacobian has a column of all zeros. I tried some other combinations of exponential functions. Again MatLab complained stating that the model returns "NaN" of "Infinity".

Some other times MatLab printed out that that maximum number of iterations had been exceeded.

I tried a power-law fit as follows:

coeffs0 = [0.8672 1 1]

opts = statset('fitnlm');

opts.RobustWgtFcn = 'bisquare';

X = [MaxFDiam' MinFDiam'];

mdlfun = @(coeff, X) coeff(1)* X(:,1).*X(:,2).^coeff(2) + coeff(3);

mdl = fitnlm(X,Mk_Superf_FSF',mdlfun, coeffs0,'Options', opts, 'CoefficientNames', {'a' , 'b', 'c'});

This time MatLab did not complain but the resulting model is anything but good. The R^2 value is awful. The P_values are very high except for one.

mdl =

Nonlinear regression model:

y ~ a*x1*x2^b + c

Estimated Coefficients:

Estimate SE tStat pValue

________ ________ ________ __________

a 0.007407 0.02121 0.34922 0.73352

b -0.26657 0.87603 -0.30429 0.76658

c 0.93603 0.048648 19.241 8.0919e-10

Number of observations: 14, Error degrees of freedom: 11

Root Mean Squared Error: 0.0363

R-Squared: 0.215, Adjusted R-Squared 0.0721

F-statistic vs. constant model: 1.51, p-value = 0.264

Maybe the model is not right. Maybe the initial parameter values are not good.....

I would greatly appreciate some help at getting a decent fit. Above all, I would like to learn techniques to:

(1) devise the model formula

(2) choose the initial parameter values

Thank you so much for any suggestion and help.

Best regards,

Maura E. M.

14 Comments
Show 12 older comments Hide 12 older comments

dpb on 19 Aug 2019

Whassup w/the first observation? Looks like complete outlier.

The response versus min diameter also looks peculiar w/ a couple of points in the middle that are very far out of line with adjacent before/after.

The response versus the area variable is far more well behaved than either of the diameter variables with again, the exception of the first point just isn't even close to the rest.

I'm not at all surprised the model doesn't fit well...you talk of the linear combination factors model not being physical; what does the system represent and is there any physical correlation that it should follow to guide the model? That's always the best thing to have if there is anything one can use.

As far as units on the other expression, while it's not the question you asked nor necessarily the best way to fit, one simply assigns units to the coefficients as needed to match the independent variables such that the fitted response does have the right units. Granted, the resulting coefficients may have no real physical interpretation, but you can make the units work arbitrarily as well as the choice of terms in the fit.

Maura E. Monville on 20 Aug 2019

Custom-Made-Collimator-On-Applicator.jpg

The data I posted are dose measurements of the Field Size Factor (FSF) carried out by a Markus ion chamber.

It is stated in the IAEA TRS-398 Code Of Practice that the field size must be at least twice the transversal size of the sensitive volume of the detector to get a meaningful reading. That's why the 1st observation is indeed an outlier. In fact the field diameter is 6[mm]. Therefore it violates the TRS-398 recommendations. That is, fields whose size is less than 1[cm] require a different detector. The Markus is not the right detector in this case.

What defines the field size is the aperture of a patient-specific collimator (see attached example of a collimator mounted on the applicator). Such collimators are custom-made to conform the radiation beam to the shape of the target (that is the ocular tumour). They are used to treat eye cancers with proton therapy. The length scale of these devices is the [mm].

The Area is obviously calculated from the polygonal that defines the aperture rim.

I extracted some other features like the Max and Min Feret diameters, excentricity, circularity, bounding box, and so on. There is no dependence on any of them but the diameters.

The goal is to find out if the measured FSF depends on the size and shape of the specific collimator. Staring at the measurements, it seems that there is no such dependence if we accept an error of about 0.2%..

The data I provided reresent the FSF for 14 different patient-specific collimators.

The difference among the FSF is within 1% with the exception of the outlier.

We are still looking for explaining the difference we measured.

You said the Area fits better than the product of the diameters. Did you use a power-law model?

Is there abetter model you suggest?

Thank you.

Regards

dpb on 20 Aug 2019

Edited: dpb on 21 Aug 2019

I hadn't yet fitted anything; I was just exploring the dataset by plotting various ways...fitting blindly w/o visualizing first is fools' errand. I plotted against each of the independent variables (after sorting by the variable) and those were enough to make me ask some questions before trying to go further...here are a couple of the plots--

As noted, these are plotted by sorting on the independent variable and using the sorted index for the response variable. What seems peculiar with the min diameter is the more jagged and the drop in the response for the two cases around 11-12. That is a very difficult detail to fit with precision and just raises questions as to whether is or is not an artifact of the measurement or real.

This is an even more detail look than I had done last night--what this shows is that the max diam and the area are nearly surrogates for each other altho one must remember in these "one at a time" plots the order isn't quite identical; this was done simply to visualize whether there was an apparent correlation with the independent variable of the desired predicted response.

I don't know the definition of the FSF nor how much precision can be presumed to be associated with the observation but with area there would appear to be a peak then a somewhat exponential decrease as area increases. That is pretty-much the gross shape with the two "diameters" and, of course, the area is going to be a function of those albeit given the arbitrary shape there's no direct simple relation there.

I suspect the problem here is that arbitrariness in the shape -- and even if you developed an almost perfect correlation from these data there would be no reason to believe it would hold for another set of observations for which the shapes weren't the same or very similar. Possibly it is that there is a unique feature there that distinguishes the two "funny" cases with the minimum diameters that isn't present in the rest of the samples.

I would wonder if other measures of geometry that try to represent the shapes in more categorical terms might produce better predictors -- like measures of curvature or lobes or such--maybe measures of perimeter might be an indicatior of that difference from being just a circular opening, who knows. I'd probably study the outlines of those shapes against the response and see if I could pick out any pattern that seemed to correlate...

Maura E. Monville on 21 Aug 2019

Thank you so much for your deeply enlightening remarks.

I tried to fit a linear regression model after removing the 1st observation which is a physical outlier. I did not come up with any physically meaningful models.

I forgot to point out that the non-physical linear model cannot be made physically meaningful by assigning proper units to the model coefficients. Infact the model finds coefficients for powers of the varialbe "x" which itself cannot be assigned any units because it is a sum of terms whose units are different. Just have a look at the picture I sent previously.

I have not tried yet a nonlinear model after removing the outlier.

I can try. Do you think it will make a big difference?

By the way, I have plotted together all the measurements carried out with the Markus ion chamber, regardless of the SOBP used (attached plot). My conclusion, staring at the composite plot, is that the measurements for the Intermediate and Superficial SOBP are very similar. They almost coincide. This may be due to a bias in our choice of representative SOBPs (see picture with the 3 SOBPs).

SOBP = Spread Out Bragg Peak.

It represents the treated tumour depth in the direction of the proton beam.

FSF is a ratio of doses.

FSF = (Dose delivered at the SOBP center by a collimator) / (Dose delivered by a reference collimator)

The Reference collimator is a perfectly circular collimator whose diamter is 25 [mm].

Radiation Dose = Energy / (Unit Mass) the units are [MeV/Kg] also called [Gy]

I agree the independent variables are correlated. For sure there is some correlation between the Area and the Max and Min Diameters.

The goal of this project was to find out whether there is a dependence of the Field Size Factor (FSF) on the size and shape of the field and on the SOBP.

The size of the field is determined by the collimator area as the collimator does conform the radiation field. In short, only the protons that pass through the collimator aperture reach the patient. The other ones are stopped in the collimator brass thickness.

I have attached three other collimator shapes that we selected. Actually, I attached the approximated collimator aperture generated by my MatLab code which was necessary to incorporate these custom-made components in the Monte Carlo model of the synchrotron (machine that produces the proton beams). I cannot attach the true collimator pictures as they carry the patient names. It would be a privacy violation.

I used MatLab function 'regionprops' to extract the characteristic feature of the collimator aperture. I wrote a script to convert the returned area and Feret diameters form pixels into [mm]

I do not know any other feature that characterizes the area defined by a polygonal.

Any suggestion is very welcome.

Thank you.

Best regards

dpb on 21 Aug 2019

I didn't say you could make the coefficients physically meaningful, only that you can assign arbitrary units to them such that the coefficient times the independent variable(s) ends up with units of the response. That's almost self evident as the response is in a given set of units so the prediction is reproducing those whatever the terms in the correlation are.

I also didn't say it was going to be easy (or even necessarily, possible) to develop a correlation given the data you have.

I would wonder which of the observations goes with which of the representative collimator shapes? I'd like to be able to compare those to which observation they generated.

Of those, there are two basic shapes, one basically an ellipsoid while the others are what I'd call a kidney-like shape as was the picture you sent earlier. A collection of those images with their associated response would be interesting.

I don't yet know whether can find a model that would predict these results or not -- probably could if made it specific enough with respect to each individual case but I still wonder if such would ever be of any value as far as drawing conclusions from regarding basic relationships.

Is it possible to take measurements with theoretical shapes without actually using patients so one could start with defined geometric shapes and then bring in the eccentricity factors? If so, I think I'd try to start with such a designed experiment where made very defined changes in shapes that are computable and classifiable and see if making changes there would produce predictable results. Then one could perturb those idealized shapes into approximations or the real ones and see how the results were affected. Just a thought--"you can't control what isn't controlled" and happenstance variables are the bane of statistics and modelling.

Maura E. Monville on 21 Aug 2019

I think I did not explain myself about the lack of physical meaning of the polynomial fit.

The polynomial is:

p1*x^7 + p2*x^6 + p3*x^5 + p4*x^4 + p5*x^3 + p6*x^2 + p7*x + p8

where

x = MaxFDiam * MinFDiam + Area / MaxFDiam + Area / MinFDiam + MaxFDiam / MinFDiam

x cannot be assigned a units because

MaxFDiam * MinFDiam % has units [mm^2]

Area / MaxFDiam [mm] % has units [mm]

Area / MinFDiam [mm] % has units [mm]

MaxFDiam / MinFDiam [] % this term is unitless

what are the units of a sum of terms whose units are:

[mm^2] + [mm] + [mm] + [] = ???

The polynomial is a sum of powers of x ....

FSF (response variable) is unitless because it is the ratio of doses [Gy /Gy] = []

I agree on the need to relate the FSF to the proper observation.

I kept the order we folled when we measured the FSF. I agree tha to look for a relationship between the FSF and the collimator Area it should be better order the data in increasing values of Area. The same applies to each diameter.

The attached table2 shows all the FSF measurements with respect to each SOBP type and the relevant features of the collimator aperture.

Please, keep in mind that FSF is the radiation dose measured using the single collimator divided by the radiation dose measured using the Reference collimator.

As you can see there are two standard perfectly circular collimator. Namely,

15mm (whose diameter is 15 [mm])

6mm (whose diameer is 6 [mm]

All the other collimators are patient-specific so their shape is forged to reproduce the contour of the tumour.

The attached table1 shows the measurements of the SOBP

Range = Start + Depth

So a SOBP can also be characterized by its Range.

Thank you for all your insight.

Kind regards

dpb on 21 Aug 2019

1) The 6 mm is the outlier so it doesn't help. The 15 mm is right in the middle of the responses outside those that are the five or so that are the "peak" values. The curious thing would be to try to isolate why those are outstanding.

2) So the 11,12,13 do correlate with the same sequence in the original dataset? I'll have to study that some. Still think it would be worthwhile to line up the images with the response to observe side by side what shape yields what response that don't have enough data to do yet here.

3) A seventh-order poly with 14 (and really should just be 13) points is well over-fitted--the goodnes of a fit will be as much coming from the fact the solution is constrained so much as that the chosen model actually represents the functional form. As noted, it's still possible the other shapes haven't seen are different-enough that there could be a categorical variable to incorporate as grouping variable rather than quantitative. May not be, too, but I'd want to investigate further that direction.

I've not had the time to dig into the additional info in the pdf files as yet...have to go do some personal errands at the moment; maybe tonight could get back and look some more. It is an interesting problem and am just beginning to get enough to have a clue about it....

Maura E. Monville on 23 Aug 2019

Archive.zip

Done.

I have cleaned the only two files, out of 12, containing the patient's name.

Now you can easily upload the txt fieles with MatLab and plot each of them.

Thank you

dpb on 23 Aug 2019

Ah! OK...I only opened and looked at the first one and presumed all the rest were the same...if weren't same number of header lines then I guess will need to to it again. Mayhaps that's why some seemed to have gaps in the perimeters...

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Jon on 21 Aug 2019

Edited: dpb on 23 Aug 2019

Open in MATLAB Online

0 votes

In your example you find a fit to a function of one variable, and are somehow looking for a combination of terms to form that one variable. Do you need to get it into this form or is it ok to have the predicted value, y be a function of two variables?

Assuming the latter, in case it is helpful I just tried a somewhat simplistic approach of considering the response to be a quadratic function of the two inputs MinFDiam and MaxFDiam.

Regarding motivation for choosing this form, I guess you could consider this to be a low order taylor series representation. (one up from linear which I tried and didn't fit very well). I'm not sure of the precise mathematical statemement of this, but the general notion is that for small enough regions all continuous functions are well approximated by just the low order terms of the Taylor series, and in particular functions that show some curvature are well approximated by quadratics in a small enough region.

I am not familiar with using fitnlm so I just used fitlm as follows

x1 = MinFDiam(:) 
x2 = MaxFDiam(:)
y = Mk_Superf_FSF(:)
mdl = fitlm([x1 x2 x1.*x2 x1.^2 +x2.^2],y)

This gave the following statistics

mdl = 
Linear regression model:
    y ~ 1 + x1 + x2 + x3 + x4 + x5
Estimated Coefficients:
                    Estimate          SE         tStat        pValue  
                   ___________    __________    ________    __________
    (Intercept)        0.60854      0.075803       8.028    4.2585e-05
    x1                0.057919      0.025395      2.2808      0.052008
    x2               0.0015331      0.014779     0.10374       0.91993
    x3              0.00067369     0.0014413     0.46741       0.65268
    x4              -0.0024835     0.0012742     -1.9491      0.087118
    x5             -0.00031494    0.00071727    -0.43908       0.67222
Number of observations: 14, Error degrees of freedom: 8
Root Mean Squared Error: 0.0204
R-squared: 0.82,  Adjusted R-Squared: 0.708
F-statistic vs. constant model: 7.3, p-value = 0.00746

Which does not seem too bad.

16 Comments
Show 14 older comments Hide 14 older comments

Jon on 22 Aug 2019

Edited: Jon on 22 Aug 2019

Open in MATLAB Online

Looking at these p values we can see that the terms that include MaxFDiam have high p values which suggests that changes in MaxFDiam do not produce a large response in Mk_Superf_FSF. In other words it appears that your output is primarily driven by just the one variable MinFDiam.

Following up on that I tried fitting Mk_Superf_FSF = c2*MinFDiam^2 + c1*MinFDiam + c0

This had a similar R-squared as the previous fit above. Which is not surprising as the terms that involved MaxFDiam had not contributed much to the fit.

Continuing then to think of this as just fitting to the one variable MinFDiam, I looked at adding a cubic term

Mk_Superf_FSF = c3*MinFDiam^2 + c2*MinFDiam^2 + c1*MinFDiam + c0

as follows, which gave quite a high R-squared and low p values. So perhaps this is a useful model. Sorry when I copy and paste the MATLAB output it seems to wrap, but please run it yourself to see it better.

x1 = MinFDiam(:) 
x2 = MaxFDiam(:)
y = Mk_Superf_FSF(:)
mdl = fitlm([x1,x1.^2,x1.^3],y)

mdl =

Linear regression model:

y ~ 1 + x1 + x2 + x3

Estimated Coefficients:

Estimate SE tStat pValue

__________ __________ _______ __________

(Intercept) 0.11948 0.06442 1.8547 0.09333

x1 0.2005 0.017762 11.288 5.1814e-07

x2 -0.014639 0.0015451 -9.4745 2.6012e-06

x3 0.00034586 4.2364e-05 8.1642 9.8515e-06

Number of observations: 14, Error degrees of freedom: 10

Root Mean Squared Error: 0.00677

R-squared: 0.975, Adjusted R-Squared: 0.968

F-statistic vs. constant model: 131, p-value = 2.53e-08

Looking deeper into this, I plotted the resulting fit and original data to obtain.

Subjectively, this looks "overfit" to me.

I would suggest (as I think did @dpb) that you need to check if the response for MinFDiam=6 is an outlier. It clearly drives the fit below. If we eliminated that one point, it looks like the output doesn't even depend on MinFDiam.

In general, if possible, it is best to have some form of theoretical model that gives you an equation with some unknown coefficients. Then just use the regression to fit the unknown coefficients.

Maura E. Monville on 23 Aug 2019

Edited: dpb on 23 Aug 2019

Open in MATLAB Online

I confirm the 6mm is physically an outlier. It should not have been measured with the Markus ion chamber. That is stated in the report IAEA TRS-398 Code Of Practice.

Yesterday I uploaded the Table reporting all the measurements for all SOBPs. I also uploaded a zipped archive with the text files containing all the collimator aperture polygonal coordinates (x,y) for whoever wishes to see (plot) the collimator shapes.

The link between the Table and the polygonals is through the collimator identifier of form Axxxxxx (capital letter "A" followed by six integer positive digits).

I placed in a matrix the Area followed by the three FSF from the single SOBP and calculated the correlation coefficients. Surprisingly the FSF from the Deep_SOBP has the highest corelation with the Area. However the plot of the three FSF, versus the Area, shows FSF for the Superficial_SOBP has the best agreement.

Here is my code:

Area = [28.274 176.71 71.32 97.45 116.21 103.22 119.63 159.29 201.80 220.96 271.45 282.36 327.04 277.54]; 
[AreaSrt, AreaInd] = sort(Area);
% SUPERFICIAL SOBP
Mk_Superf_FSF = [0.8672 1.0053 1.008 1.0142 1.0128 1.0142 1.0142 1.004 1.0062 1.0031 1.0018 1.0022 1.0022 1.0018];
Mk_Superf_FSFSrt = Mk_Superf_FSF(AreaInd);
R = corrcoef(AreaSrt', Mk_Superf_FSFSrt')
% INTERMEDIATE SOBP
Mk_Inter_FSF = [0.86664 1.0028 1.0098 1.0098 1.0123 1.0109 1.0102 1.0049 1.0042 1.0018 1.0007 1.0007 1 1.0014];                
Mk_Inter_FSFSrt = Mk_Inter_FSF(AreaInd);
R = corrcoef(AreaSrt',Mk_Inter_FSFSrt')
% DEEP SOBP
Mk_Deep_FSF = [0.84279 0.99719 0.99264 1.0003 1.0008 1.0001 0.99946  0.99913 1.0003 0.9993 0.99962 1.0001 0.99995  1.0004];                              
Mk_Deep_FSFSrt = Mk_Deep_FSF(AreaInd);
R = corrcoef(AreaSrt',Mk_Deep_FSFSrt')
% Place all vectors in matrix
M = [AreaSrt' Mk_Superf_FSFSrt' Mk_Inter_FSFSrt' Mk_Deep_FSFSrt'];
% COMPUTE MATRIX CORRELATION COEFFICIENTS
Rnew = corrcoef(M)

I get the following matrix of correlation coefficients showing that the highest correlation is between Area and the FSF for Deep_SOBP:

>> Rnew = corrcoef(M)
Rnew =
            1      0.36357      0.36585      0.48136
      0.36357            1      0.99896      0.99037
      0.36585      0.99896            1      0.99095
      0.48136      0.99037      0.99095            1

dpb on 24 Aug 2019

Edited: dpb on 25 Aug 2019

"I think the[r]e is a physics explanation for the higher measurements."

And well may be but I think it highly unlikely that explanation is in the variables controlled/measured here.(*)

The MC simulation misses those specific points by far more than the others in a consistent direction so whatever it is isn't included in that model, either.

(*) And note that even if you were successful at building a model by some magic transformation of variables or nonlinear curve-fitting strategem that did manage to fit the observations from these measurements that to infer that would be the physical reason behind the values would be a gross misrepresentation of such a fit even if you could make it happen with a set of coefficients with consistent units.

Maura E. Monville on 9 Sep 2019

We sort of come up with a physical explanation.

The reason for the dose measured at the center of the SOBP to be the highest for the Superficial SOBP, decresing for the Intermediate SOBP, and the lowest for the Deep SOBP is clearly beam attenuation due to the center of the SOBP being located deeper and deeper in the eye.

The asymptotic trend towards 1 of the measurement as the colimator area grows bigger is possibly due to scttered radiation not reaching the SOBP center, so not contributing to the measured dose, since it gets more spread laterally as the collimator aperture grows bigger.

Noteworthy is that the energies usd for ocular treatment are very low. Scttered radiation produced by the collimator has of course even lower energy.

I would like to thank everyone who has taken the time to look into my problem.

Thank you.

Sincerely,

Maura E. M.

Sign in to comment.

nonlinear fit of experimental data

14 Comments
Show 12 older comments Hide 12 older comments

Answers (1)

16 Comments
Show 14 older comments Hide 14 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

nonlinear fit of experimental data

14 Comments Show 12 older comments Hide 12 older comments

Answers (1)

16 Comments Show 14 older comments Hide 14 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

14 Comments
Show 12 older comments Hide 12 older comments

16 Comments
Show 14 older comments Hide 14 older comments