2019 https://blog.janestreet.com/l2-regularization-and-batch-norm/ https://machinelearningmastery.com/batch-normalization-for-training-of-deep-neural-networks/ 2018 https://machinelearningmastery.com/difference-between-a-batch-and-an-epoch/ https://blog.paperspace.com/busting-the-myths-about-batch-normalization/ 2016 Goodfellow - Ch 9: Convolutional Networks http://www.deeplearningbook.org/slides/dls_2016.pdf https://www.youtube.com/watch?v=Xogn6veSyxA https://www.reddit.com/r/MachineLearning/comments/67gonq/d_batch_normalization_before_or_after_relu/ https://arxiv.org/pdf/1502.03167.pdf https://arxiv.org/pdf/1805.11604.pdf