Summary of “A Bayesian Perspective on generalization and stochastic gradient descent, ICLR 2018” | by Jimmy (xiaoke) Shen | Medium

Summary of “A Bayesian Perspective on generalization and stochastic gradient descent, ICLR 2018”
Jimmy (xiaoke) Shen
·Follow
1 min read·
Mar 12, 2020
--
This paper is a follow-up paper of “UNDERSTANDING DEEP LEARNING REQUIRES RETHINKING GENERALIZATION, ICLR 2017”,
Important/Interesting observations from the paper“We also demonstrate that, when one holds the learning rate fixed, there is an optimum batch size which maximizes the test set accuracy.”
--
--
Written by Jimmy (xiaoke) Shen306 Followers
·351 Following
MLE/SWE @meta
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams