Jimmy Shen

Screenshot from [1]

Based on [1], for RL algorithm, we can have:

  • Value-based
  • Policy-based
  • Both value and policy based algorithms.

Some nice book and tutorials can be found in [2][3][4].

This article is not complete yet, I will keep on updating. If any suggestions, please leave a comment. Thanks!


[1] Lihong Yi video…



I have been struggling for a long time on paper writing especially for the abstract. However, Professor Liang Huang’s lecture helps a lot on how to wrtie up papers. The lecture notes can be found here. In this article, I am focusing on how to write paper abstract.

This screen short is from the slides here

In the…