bias variance tradeoff

Three models of fitting. Consider trying to fit some dataset \(|D|= n\) that’s roughly quadratic with…

  • a linear model: underfit, high bias (i.e. “model imposes bias of linearity on data”)
  • a nth order polynomial: overfit, high variance (i.e. “a small perturbation of data brings lots of change”)

Its important to pay attention if you are having high bias of high variance—solutions of each is different from each other.

intuition of overfitting

See overfit

diagnosing bias variance tradeoff problems

Let’s consider:

ReferenceTrainingTestJudgment
2%2%2%High Bias
10%2.5%10%High Variance
10%10%20%Both High Bias and Variance