Proximal Policy Optimization in Reinforcement learning
Proximal Policy Optimization (PPO) is a popular algorithm for Reinforcement learning. In this article, I will put some tutorials I feel helpful during my learning process.
A general RL tutorial summary can be found here
Please check . In case you understand Chinese, there are some tutorials in .
Further reading about the first author of the PPO algorithm
Of course, based on the audience’s different background, you may have different requirements on your learning process. If you wanna understand the basic and know how to use the PPO algorithm, the info here is pretty much enough. However, if you wanna do research, i highly recommend you read the papers and blogs from . It is really helpful.
 Proximal Policy Optimization Algorithms paper
PPO source code reading https://blog.csdn.net/jinzhuojun/article/details/80417179
first author John Schulman
 first author’s advisor Pieter Abbeel https://people.eecs.berkeley.edu/~pabbeel/