K-mean Clustering

Question

0 votes

Hi Everyone, can someone help me on how to use the K-mean clustering or perhaps share with me the suitable coding use to cluster wind speed data. I hava wind speed data in the form of Latitude, Longitude, Wind Speed. I want to cluster the data into 3 groups.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Image Analyst on 12 Nov 2021

Open in MATLAB Online

0 votes

If you have all the lat and lon values, then just put each into kmeans separately:

numColumns = 26; % Or however many columns you know there to be.
[xIndexes, xCentroids] = kmeans(lon, numColumns);
numRows = 50; % Or however many rows you know there to be.
[yIndexes, yCentroids] = kmeans(lat, numRows);

The values of the columns (x or longitude values) will be in xCentroids.

The values of the rows (y or lat values) will be in yCentroids.

16 Comments
Show 14 older comments Hide 14 older comments

Image Analyst on 14 Nov 2021

Open in MATLAB Online

WIND_26YEARS.csv

, it doesn't make sense. Why is Y1 random? And why is Y1 a row vector while X1 is a column vector? Even if Y1 was also a column vector, it doesn't make sense to cluster random data.

And where is K in your kmeans() call? You read in the badly-named "k" but don't even consider it when you're doing kmeans? Did you realize you're calling kmeans without your data???

I would have fixed it for you but I realized I don't know what each row of k represents.

% Demo by Image Analyst
clc;    % Clear the command window.
close all;  % Close all figures (except those of imtool.)
clear;  % Erase all existing variables. Or clearvars if you want.
workspace;  % Make sure the workspace panel is showing.
format long g;
format compact;
fontSize = 22;
% Read in data
k = readmatrix('WIND_26YEARS.csv');
% Plot raw data
subplot(3, 1, 1);
plot(k, 'b-')
grid on;
xlabel('index', 'FontSize',fontSize);
ylabel('Value of k', 'FontSize',fontSize)
title('All the k Values', 'FontSize',fontSize)
% Plot histogram of k data.
subplot(3, 1, 2);
histogram(k);
grid on;
xlabel('k', 'FontSize',fontSize);
ylabel('Count', 'FontSize',fontSize)
title('Distribution of k.  Note no clusters!', 'FontSize',fontSize)
% Original poster's (bad) code below:
subplot(3, 1, 3);
X1=(1:6943)';
Y1=randn(6943,1);
numClusters=3;
idx1=kmeans([X1, Y1],numClusters,'Replicates',5);
pointclust=repmat(idx1,1,numClusters)==repmat(1:numClusters,numel(idx1),1);
colors=hsv(numClusters);
for j=1:numClusters
    plot(X1(pointclust(:,j)),Y1(pointclust(:,j)),'Color',colors(j,:));
    if j==1
        hold on;
    end
end
hold off;
xlabel('X1', 'FontSize',fontSize);
ylabel('Y1', 'FontSize',fontSize)
title('Clusters are in different colors', 'FontSize',fontSize)
grid on;
g = gcf;
g.WindowState = 'maximized'

Image Analyst on 17 Nov 2021

Open in MATLAB Online

WIND_26YEARS.csv

I think you uploaded a different data file than you think. Look what happens when I run this code:

% Read in data
k = readmatrix('WIND_26YEARS.csv');
k = readmatrix('WIND_26YEARS.csv');
lats = k(1:3:end); % 2315 long
lons = k(2:3:end); % 2314 long
speeds = k(3:3:end); % 2314 long
whos k
whos lats
whos lons
whos speeds

Name Size Bytes Class Attributes

k 6943x1 55544 double

Name Size Bytes Class Attributes

lats 2315x1 18520 double

Name Size Bytes Class Attributes

lons 2314x1 18512 double

Name Size Bytes Class Attributes

speeds 2314x1 18512 double

As you can see, k is not a multiple of 3 so lats is one element longer than the other two. Why is that?

Moreover, the wind speeds are practically the same value as lats and lons (they are all around values 0-8), which is suspicious unless you measured the wind near the north pole. Please attach the actual data.

Image Analyst on 18 Nov 2021

So can we just take the first 2314 values and ignore the extra lat?

MAT NIZAM UTI on 18 Nov 2021

Edited: MAT NIZAM UTI on 18 Nov 2021

Sure..well I dont really know how the matlab works, because after comparing the actual values and the after read values, both lons and speeds were different with the actual data.

https://drive.google.com/drive/folders/1tFOl0ZHQo4XzB-VGvi-LBLPg_lERG98u (this is my very actual data) Column C until LB is the wind speeds values.

Sign in to comment.

Answer 2

H R on 9 Nov 2021

Open in MATLAB Online

1 vote

Plese see: https://www.mathworks.com/help/stats/kmeans.html.

If your data is in a matrix format X, then you can use the following:

[idx,C] = kmeans(X,3,'Distance','cityblock','Replicates',5);

6 Comments
Show 4 older comments Hide 4 older comments

H R on 12 Nov 2021

Yes, every thing is possible (even using 1D data) , but you have to finally check what you are looking for from the clustering task and check if the outcome makes sense to you.

MAT NIZAM UTI on 14 Nov 2021

Edited: Image Analyst on 14 Nov 2021

Open in MATLAB Online

WIND_26YEARS.csv

Here is my coding, and I have an error on it

Error using horzcat

Dimensions of matrices being concatenated are not consistent.

Error in k_mean (line 7)

idx1=kmeans([X1 Y1],numClusters,'Replicates',5);

This is the code:

k = xlsread('WIND_26YEARS.csv');
X1=(1:6943);
Y1=randn(6943,1); 
numClusters=3;
idx1=kmeans([X1 Y1],numClusters,'Replicates',5);
pointclust=repmat(idx1,1,numClusters)==repmat(1:numClusters,numel(idx1),1);
colors=hsv(numClusters);  
for j=1:numClusters,
    plot(X1(pointclust(:,j)),Y1(pointclust(:,j)),'Color',colors(j,:));
    if j==1,
        hold on;
    end;
end,
hold off;

Sign in to comment.

K-mean Clustering

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

16 Comments
Show 14 older comments Hide 14 older comments

More Answers (1)

6 Comments
Show 4 older comments Hide 4 older comments

Categories

Products

Tags

Community Treasure Hunt

K-mean Clustering

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

16 Comments Show 14 older comments Hide 14 older comments

More Answers (1)

6 Comments Show 4 older comments Hide 4 older comments

Categories

Products

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

16 Comments
Show 14 older comments Hide 14 older comments

6 Comments
Show 4 older comments Hide 4 older comments