Maxton‘s Blog

Blog Docs Links About Timeline 中文

Back

Tags: #policy gradient

Feb 22, 2026

RL Study Notes: Policy Gradient Methods

Core concepts of RL policy gradient methods: objective functions, the log-derivative trick, theorem derivation, and the REINFORCE algorithm.

6 min English