Training error prediction higher than Training error in training process

Question

Claudia Borredon on 13 Jul 2021

0
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/877758-training-error-prediction-higher-than-training-error-in-training-process

Answered: Avadhoot on 21 Feb 2024

Is it correct to check that the values of the Training error at the end of the training process and of the Training error of the prediction at fixed net are the same? I am currently using an RNN which in the training phase gives an error of 3% on the Trainig set, but when I use it to predict the values of the Training set I get an error of 18% (I am using it for prediction, not classification), while the error on the Validation set is the same in both cases. Is there any finalization on the network after the last step which might lead to this result?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Avadhoot on 21 Feb 2024

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/877758-training-error-prediction-higher-than-training-error-in-training-process#answer_1413483

Hi Claudia,

I have interpreted from your question that you are getting a very low error rate on your training samples but a high error rate on your testing samples, while the validation error rate remains the same.

This looks like a classic example of overfitting where the model performs extremely well on the training set but fails to perform on the testing data. You can use the following ways to remedy this:

Regularization: You can use L2 regularization in your loss function to reduce the overfitting in your model. This promotes generalization in the model.
Early stopping: Monitor the performance of the model on a validation set and stop training when performance begins to degrade, indicating overfitting. This can be specified in the training options in MATLAB.
Data Augmentation : You can perform data augmentation on your training dataset to create new samples from the existing data samples by adding noise, applying temporal distortions, or using techniques like back-translation for text data. This causes the model to generalize well. You can find data augmentation options in MATLAB datastores.
Reduce model complexity: Try reducing model complexity so that the overfitting is remedied.
Weight initialization: Initialize weights using an initialization scheme like Xavier or He initialization.
Hyperparameter optimization: Find better values of hyperparameters in your model by performing grid search or random search on the hyperparameters.

For more information about data augmentation and early stopping, refer to the below documentation:

Hope this helps.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Training error prediction higher than Training error in training process

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

Training error prediction higher than Training error in training process

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments