Poke | Advanced Actor Critic and Policy Gradient Methods

Advanced Actor Critic and Policy Gradient Methods

12 videos • 13,401 views • by Machine Learning with Phil In this series of deep reinforcement learning tutorials, you will learn how to apply advanced actor critic methods to environments from the Open AI Gym with continuous action spaces. You will read and implement the original deep deterministic policy gradients (DDPG) paper. We'll also cover how to handle multithreaded processing in Python, with the asynchronous advantage actor critic algorithm (A3C). We move on to more advanced topics such as proximal policy optimization (PPO), twin delayed deep deterministic policy gradients (TD3), and soft actor critic (SAC). Tutorials are presented in both the PyTorch and Tensorflow deep learning frameworks.

Multicore Deep Reinforcement Learning | Asynchronous Advantage Actor Critic (A3C) Tutorial (PYTORCH)

Machine Learning with Phil
Download

Everything You Need to Know About Deep Deterministic Policy Gradients (DDPG) | Tensorflow 2 Tutorial

Machine Learning with Phil
Download

How to Implement Deep Learning Papers | DDPG Tutorial

Machine Learning with Phil
Download

Reinforcement Learning in Continuous Action Spaces | DDPG Tutorial (Tensorflow)

Machine Learning with Phil
Download

Reinforcement Learning in Continuous Action Spaces | DDPG Tutorial (Pytorch)

Machine Learning with Phil
Download

Proximal Policy Optimization (PPO) is Easy With PyTorch | Full PPO Tutorial

Machine Learning with Phil
Download