Further reading
For more information, refer to the following papers:
- Continuous Control with Deep Reinforcement Learning by Timothy P. Lillicrap, et al., https://arxiv.org/pdf/1509.02971.pdf
- Addressing Function Approximation Error in Actor-Critic Methods by Scott Fujimoto, Herke van Hoof, David Meger, https://arxiv.org/pdf/1802.09477.pdf
- Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor by Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine, https://arxiv.org/pdf/1801.01290.pdf