Overfitting: Difference between revisions

From Computer Science Wiki
No edit summary
No edit summary
 
Line 3: Line 3:
== Introduction ==
== Introduction ==


Overfitting is when a statistical model takes too many parameters. (continue general meaning)
Overfitting is a phenomenon that occurs when a machine learning model is trained too well on the training data. This can cause the model to perform poorly on unseen data, because it has learned patterns in the training data that do not generalize to new data.
 
In Machine Learning, overfitting is when you take into account too many random factors. The problem lies within the random factors, as there is always some ''noise''. (reference to define noise). When the AI overfits, it ends up learning from the noise, which might make decision making a lot worse.


In overfitting, the model becomes too complex and fits the noise in the training data rather than the underlying relationship. This can lead to poor generalization performance, as the model will be sensitive to the noise in the training data and may not be able to generalize to unseen data.


One way to mitigate overfitting is to use techniques such as regularization, which helps to constrain the model and reduce its complexity. Other techniques include using a larger training dataset, using cross-validation to tune the model, and early stopping to prevent the model from becoming too complex.


== References ==
== References ==

Latest revision as of 08:12, 7 January 2023

Case study notes[1]

Introduction[edit]

Overfitting is a phenomenon that occurs when a machine learning model is trained too well on the training data. This can cause the model to perform poorly on unseen data, because it has learned patterns in the training data that do not generalize to new data.

In overfitting, the model becomes too complex and fits the noise in the training data rather than the underlying relationship. This can lead to poor generalization performance, as the model will be sensitive to the noise in the training data and may not be able to generalize to unseen data.

One way to mitigate overfitting is to use techniques such as regularization, which helps to constrain the model and reduce its complexity. Other techniques include using a larger training dataset, using cross-validation to tune the model, and early stopping to prevent the model from becoming too complex.

References[edit]