question about determining the correct number of clusters (documentation)
Show older comments
I have a question about determining the correct number of clusters (in k-means clustering).
In the documentation, there is a section about 'determining the correct number of clusters'. Please help me understand what the arguments in the table are for: iter (which would mean iterations), phase (?), num (?), and sum(?).
In the example, you will see:
Best total sum of distances = 1771.1
Here are my questions:
- Are we going for the best total sum of distances?
- From here, how are we able to determine that 4 is indeed the correct?
Answers (1)
Bernhard Suhm
on 3 Oct 2018
0 votes
You can use the silhouette plot or evalclusters function to evaluate the quality of your clustering. There's a little more info in this previous answer.
Categories
Find more on Cluster Analysis and Anomaly Detection in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!