How do I find slack variables in SVM?/ Distance to the boundary?
Show older comments
Hi, I used "svmtrain" to train the algorithm: svmStruct=svmtrain(xdata,group); I used "svmclassify" to classify.
My data is not perfectly linearly separable but I still used a linear classifier. In theory, allowances are made for by a slack variable. (Soft margin) I refer to the toolbox help where the theory is. It mentions the slack variable and 2 ways it is computed. http://www.mathworks.com/help/bioinfo/ug/support-vector-machines-svm.html
My issue is that, svmStruct does not save the slack variable. Neither can I find it in the function to recall it and save it.
If not, how can I find the distance from each data point to the boundary?
Can anyone help me with this? Thanks
Answers (2)
the cyclist
on 8 Feb 2013
0 votes
It is the input parameter 'boxconstraint' to the svmtrain() command. The default value is 1.
5 Comments
Mech Princess
on 8 Feb 2013
Edited: Mech Princess
on 8 Feb 2013
the cyclist
on 8 Feb 2013
What values? 'boxconstraint' is an input parameter. Why would it need to be reproduced in the output?
Mech Princess
on 8 Feb 2013
Edited: Mech Princess
on 8 Feb 2013
the cyclist
on 8 Feb 2013
Sorry! I thought the slack variable was the parameter C (which is actually the "penalty parameter").
I am not 100% sure that the slack variables have to be explicitly calculated to solve for the support vectors. (It's been a long time since I have used these techniques, and I actually don't have the toolbox at this time, to check.)
Sorry to have been more a distraction than a solution!
Mech Princess
on 9 Feb 2013
Ilya
on 10 Feb 2013
0 votes
By definition, a slack variable for observation x with label y (-1 or +1) is max(0,1-y*f), where f is the SVM prediction (soft score ranging from -inf to +inf). svmclassify does not return the scores, so you need to compute the SVM scores yourself. Start with the definition of the SVM model, compute kernel products, multiply by the alpha coefficients and add the bias term. It is easier than it sounds.
6 Comments
Mech Princess
on 11 Feb 2013
Edited: Mech Princess
on 11 Feb 2013
Ilya
on 11 Feb 2013
What you posted does not make much sense.
alpha is undefined.
If alpha is n-by-1, y is n-by-1 and x is n-by-p, alpha*y*x gives an error.
There is no such thing as "slack variable" for test data. Slack is used for solving the SVM problem and makes sense for training data only.
Y(w*X-b) is not a valid MATLAB expression.
Mech Princess
on 11 Feb 2013
Ilya
on 11 Feb 2013
Here is how you compute SVM scores for the new data in Xnew and SVM model saved in svm_struct:
sv = svm_struct.SupportVectors;
alphaHat = svm_struct.Alpha;
bias = svm_struct.Bias;
kfun = svm_struct.KernelFunction;
kfunargs = svm_struct.KernelFunctionArgs;
f = kfun(sv,Xnew,kfunargs{:})'*alphaHat(:) + bias;
The distance from the boundary depends on what you mean by "boundary". The decision boundary is defined by f=0, and the signed distance is then f. The support hyperplanes are defined by y*f=1, and the signed distance is y*f-1.
Mark
on 19 Aug 2014
Thanks!
Only one remark: I think that this works fine if 'autoscale' is set to false (in the svmtrain function). If the data is scaled, you should also scale Xnew before you feed it to the kernel function:
shift = svm_struct.ScaleData.shift;
scale = svm_struct.ScaleData.scaleFactor;
Now you can scale Xnew:
XnewScaled = ( Xnew - shift ) .* scale;
and then use XnewScale in the kernel function as above:
f = kfun( sv, XnewScaled, kfunargs{:} )' * alphaHat + bias
Aliza Rubenstein
on 19 Jan 2017
Thanks for this information. I think it should be Xnew + shift, not Xnew - shift. shift is the negative of the mean.
Categories
Find more on MATLAB in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!