How to classify images depending on the shape of each image's object ?

Mohamed Elbeialy

21 Apr 2021

3 Answers

Updated 5 Jun 2021

87 Views (30 days)

Follow Question

Show older comments

0 votes

6 Comments
Show 4 older comments Hide 4 older comments

Mohamed Elbeialy on 24 Apr 2021

5 classes of cars, flowers, buidlings, trees, dogs. The point is to classify them according to the shape of the object inside each image. Using max value of the image is just a hint. if you have another way to determine the shape of object, ues it .

Amit on 5 Jun 2021

Follow following steps,

First of all you need to binarize the image and find edges of image using canny edge detection.
Then you need use regionprops to extract various isolated regions in the image.
You need to find centroid of regions.
Then at various angles, you can find distance between centroid of region and point on edge of image.
This makes your shape descriptors.
You can compare these descriptors with descriptors of known share to categorize your query or unkown object.

This should work for you.

I have my IEEE paper published regarding above process, you can send request to me on, amit.kenjale@gmail.com, I will send you my IEEE paper where this process is explained in details with images showing intermediate results.

Follow Question

Answers (3)

Mahesh Taparia on 24 Apr 2021

1 vote

There is already an existing answer similar to this problem. You can refer this link for that.

1 Comment
Show -1 older comments Hide -1 older comments

Mohamed Elbeialy on 24 Apr 2021

This link does not have any relation with my question.

Image Analyst on 24 Apr 2021

0 votes

Try the transfer learning example with CNN/AlexNet. There should be demos in the Deep Learning Toolbox.

30 Comments
Show 28 older comments Hide 28 older comments

Mohamed Elbeialy on 25 Apr 2021

Open in MATLAB Online

This is the code I built, and looking to edit it to a code that classifies the shape of each image. I am not looking to specific shape such as triangle, circle, square, but the shape of the object inside each image, which can be anything random shape.

imds = imageDatastore('images','IncludeSubfolders',true,'LabelSource','foldernames');
[imdsValidation,imdsTrain1,imdstest,imdsTrain2] = splitEachLabel(imds,0.1,0.1,0.8)
layers = [
    imageInputLayer([227 227 3],"Name","data")
    convolution2dLayer([11 11],94,"Name","conv1","BiasLearnRateFactor",2,"Stride",[4 4])
    reluLayer("Name","relu1")
    crossChannelNormalizationLayer(5,"Name","norm1","K",1)
    maxPooling2dLayer([3 3],"Name","pool2","Stride",[2 2])
    convolution2dLayer([3 3],384,"Name","conv3","BiasLearnRateFactor",2,"Padding",[1 1 1 1])
   fullyConnectedLayer(5,"Name","new fc","BiasLearnRateFactor",10,"WeightLearnRateFactor",10)
    softmaxLayer("Name","prob")
    classificationLayer("Name","classoutput")];
miniBatchSize = 25;
valFrequency = floor(numel(augimdsTrain.Files)/miniBatchSize);
options = trainingOptions('sgdm', ...
    'MiniBatchSize',25, ...
    'MaxEpochs',8, ...
    'ValidationData',augimdsValidation, ...
    'ValidationFrequency',valFrequency, ...
    'ValidationPatience',4
    'Plots','training-progress');
trainedNet = trainNetwork(augimdsTrain,layers,options);  % train the network 
[YPred,probs] = classify(trainedNet,augimdsValidation);   % classify the validation images
fracCorrect = accuracy/numel(YPred)

Walter Roberson on 25 Apr 2021

Open in MATLAB Online

imds = imageDatastore('images','IncludeSubfolders',true,'LabelSource','foldernames');

Okay, you have an image data store named imds

[imdsValidation,imdsTrain1,imdstest,imdsTrain2] = splitEachLabel(imds,0.1,0.1,0.8)

Okay, you have four more image data store objects whose name begin with imds

valFrequency = floor(numel(augimdsTrain.Files)/miniBatchSize);

augimdsTrain is not defined. The variable name is consistent with an augmented image data store, not with an image data store.

options = trainingOptions('sgdm', ...
    'MiniBatchSize',25, ...
    'MaxEpochs',8, ...
    'ValidationData',augimdsValidation, ...
    'ValidationFrequency',valFrequency, ...
    'ValidationPatience',4
   'Plots','training-progress');

augimdsValidation is not defined. The variable name is consistent with an augmented image data store, not with an image data store.

valFrequency is not defined.

You are missing a line continuation on the ValidationPatience line.

   trainedNet = trainNetwork(augimdsTrain,layers,options);  % train the network 
[YPred,probs] = classify(trainedNet,augimdsValidation);   % classify the validation images

augimdsTrain and augimdsValidation are not defined.

Mohamed Elbeialy on 27 Apr 2021

Here it is. how to make sure that the network will detect all random shapes of all images inside the imageData store

Image Analyst on 27 Apr 2021

Open in MATLAB Online

To get a binary image mask of inside where you traced the outline:

binaryImage = grayImage < someValue;

You'd have to train (label) your training images all with the outline and with the class you know them to be (car, dog, etc.)

Walter Roberson on 25 Apr 2021

Open in MATLAB Online

0 votes

imageInputLayer([227 227 3],"Name","data","Normalization","rescale-zero-one")

This will rescale each input image to have a maximum value of 1.

20 Comments
Show 18 older comments Hide 18 older comments

Walter Roberson on 26 Apr 2021

Open in MATLAB Online

When you create an image data store and specify 'normalization', 'rescale-zero-one' but no 'Min' or 'Max' value, then by default the code does the operation

minval = double( min(ThisImage(:)) );
maxval = double( max(ThisImage(:)) );
ScaledImage = (double(ThisImage) - minval) ./ (maxval - minval);

The outcome of this is that the maximum value of ScaledImage will be 1 and the minimum will be 0. The data will be rescaled from whatever range it happens to be. For example if the image happened to be grayscale uint8 data in the range 10 to 62, then you would subtract 10 from each location and divide the result by (62-10 = 52). This would map the 10 values to 0, and would map the 62 values to 1, and everything in-between would be linear scaled -- so for example (36 - 10)/(62-10) -> 1/2

In other words makes the value of the maximum pixel into 1 and all other values will be less than 1.

The difference between what this does (which you said earlier you do not want to do) and what you want to do is... difficult for us to understand. Perhaps you only want to look at a subset of the image and scale that subset, that you want to detect a "shape" first with the shape not including all of the image, and then you want to find the maximum value.

Or perhaps you see some difference between the values of the pixels as passed in by reading the image, as being different than "brightness" ??

It is hypothetically possible that even though a particular pixel might have the maximum uint8 value of 255 (maximum brightness), that you have a non-linear mapping between pixel value and the value you want to scale against. For example it is hypothetically possible that the intensities at the pixels represent the distance from the "center", with the "center" value being near 128, and that what you are looking for as the "maximum" might mean "closest to the center value" instead of maximum displacement that displays as "brightest". We would tend to think that you would have already told us if that was the case, but since you have not told us otherwise, and have told us that you don't mean "brightness", then we cannot rule it out.

Mohamed Elbeialy on 27 Apr 2021

here it is, however, I stucked with (gTruth ) which does not allow me to insert all imageData store images [imds,blds] = objectDetectorTrainingData(gTruth)

Image Analyst on 27 Apr 2021

Open in MATLAB Online

You asked: "I do find the code for detecting object inside image, but I do not know how to apply to the whole imageDatastore." So, try this:

ds = imageDatastore('*.png')
numFiles = numel(ds.Files)
% Apply "code for detecting object inside image" to "the whole
% imageDatastore" - apply to every image in the image datastore.
for k = 1 : numFiles
	thisFullFileName = ds.Files{k};
	fprintf('Analyzing #%d of %d : "%s" ...\n', k, numFiles, thisFullFileName);
	theImage = imread(thisFullFileName);
	imshow(theImage);
	[folder, baseFileNameNoExt, ext] = fileparts(thisFullFileName);
	title(baseFileNameNoExt, 'Interpreter', 'none');
	drawnow;
	% Now give code to do something to analyze theImage...
        % Put your existing code "for detecting object inside image" here:
end

Products

MATLAB

Release

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

How to classify images depending on the shape of each image's object ?

6 Comments
Show 4 older comments Hide 4 older comments

Answers (3)

1 Comment
Show -1 older comments Hide -1 older comments

30 Comments
Show 28 older comments Hide 28 older comments

20 Comments
Show 18 older comments Hide 18 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

How to classify images depending on the shape of each image's object ?

6 Comments Show 4 older comments Hide 4 older comments

Answers (3)

1 Comment Show -1 older comments Hide -1 older comments

30 Comments Show 28 older comments Hide 28 older comments

20 Comments Show 18 older comments Hide 18 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

6 Comments
Show 4 older comments Hide 4 older comments

1 Comment
Show -1 older comments Hide -1 older comments

30 Comments
Show 28 older comments Hide 28 older comments

20 Comments
Show 18 older comments Hide 18 older comments