training alexnet from scratch (i.e. reset weight)

Question

Andrea Apiella on 21 Nov 2017

1
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/368387-training-alexnet-from-scratch-i-e-reset-weight

Answered: Amir Ebrahimi on 1 Nov 2019

I would train an alexnet DNN given by matlab function

alexnet

from scratch (i.e. without pretraining on ImageNet given by alexnet function). I could to manually set weights but I don't know the from what distribution I can sample my initial weights. Is there a built-in matlab option that make it for me? e.g. I read that python's library has the option pre-training=off but I don't find a similar option in matlab.

1 Comment
Show -1 older commentsHide -1 older comments

Ariel Avshalumov on 8 Aug 2018

Maybe a Gaussian white noise distribution would work for you? I also have the same problem. Let me know if you find something relevant!

Sign in to comment.

Sign in to answer this question.

Answer 1

Ariel Avshalumov on 16 Aug 2018

2
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/368387-training-alexnet-from-scratch-i-e-reset-weight#answer_333003

Edited: Ariel Avshalumov on 16 Aug 2018

Open in MATLAB Online

This might be what you are looking for. My friend discovered this when he wanted to do a similar thing with a CNN.

net = alexnet;
net.Layers

Once you see the layers pick all the convolution layers and fully connected layers and remember their position (eg. layer 6 is convolution or layer 20 is fully connected etc.)

Then all you need to do is to use this singe line of code for each layer that you want to change:

layers(X).Weights = randn([x y z t]) * 0.01;

Where X is the position number of the layer (layer 6 or 2 etc) and

[x y z t] is [FilterSize(1) FilterSize(2) NumChannels NumFilters]

You can find all of these values by writing a little bit of code which pulls information about the layers in the net or you can manually look at each layer by clicking on net in the workspace and then clicking the layer that you want. This information will be a set of variables that are tied with that layer.

The variables could also be different if you look at different layers. For example the Fully Connected layers have the property called Weights which is a 2 dimensional matrix so all you need to change in that case is this property.

layers(X).Weights = randn([Weights(1) Weights(2)]) * 0.01;

This might be more 'manual' than you prefer but it probably does what you need. If you want you can probably use the white Gaussian noise distribution instead of creating your own random distribution but I think that both the way I showed and the white noise distribution produce the same effect.