profile pic
⌘ '
raccourcis clavier

usually prone to overfitting given they are often over-parameterized

  1. We can usually add regularization terms to the objective functions
  2. Early stopping
  3. Adding noise
  4. structural regularization, via adding dropout

dropout

a case of structural regularization

a technique of randomly drop each node with probability pp