Debugging Neural Nets (for NLP) (2/15/2018)
- Identifying problems
- Debugging training time problems
- Debugging test time problems
- Interesting Reading
- Reference: Highway Networks (Srivastava et al. 2015)
- Reference: Residual Connections (He et al. 2015)
- Reference: Rethinking Generalization (Zhang et al. 2017)
- Reference: Marginal Value of Adaptive Gradient Methods (Wilson et al. 2017)
- Reference: Adam w/ Learning Rate Decay (Denkowski and Neubig 2017)
- Reference: Dropout (Srivastava et al. 2014)
- Reference: Recurrent Dropout (Gal and Ghahramani 2015)
- Reference: Minibatch Creation Strategies (Morishita et al. 2017)
- Reference: Decoding Problems (Koehn and Knowles 2017)
Slides: Debugging Slides