Advanced Reinforcement Learning in Python: from DQN to SAC

Build Artificial Intelligence (AI) agents using Deep Reinforcement Learning and PyTorch: DDPG, TD3, SAC, NAF, HER.

This is the most complete Advanced Reinforcement Learning course on Udemy. In it, you will learn to implement some of the most powerful Deep Reinforcement Learning algorithms in Python using PyTorch and PyTorch lightning. You will implement from scratch adaptive algorithms that solve control tasks based on experience. You will learn to combine these techniques with Neural Networks and Deep Learning methods to create adaptive Artificial Intelligence agents capable of solving decision-making tasks.

What you’ll learn

Master some of the most advanced Reinforcement Learning algorithms..
Learn how to create AIs that can act in a complex environment to achieve their goals..
Create from scratch advanced Reinforcement Learning agents using Python’s most popular tools (PyTorch Lightning, OpenAI gym, Brax, Optuna).
Learn how to perform hyperparameter tuning (Choosing the best experimental conditions for our AI to learn).
Fundamentally understand the learning process for each algorithm..
Debug and extend the algorithms presented..
Understand and implement new algorithms from research papers..

Course Content

Introduction –> 4 lectures • 8min.
Refresher: The Markov Decision Process (MDP) –> 11 lectures • 32min.
Refresher: Q-Learning –> 5 lectures • 11min.
Refresher: Brief introduction to Neural Networks –> 7 lectures • 34min.
Refresher: Deep Q-Learning –> 4 lectures • 9min.
PyTorch Lightning –> 15 lectures • 1hr 20min.
Hyperparameter tuning with Optuna –> 6 lectures • 25min.
Deep Q-Learning for continuous action spaces (Normalized Advantage Function) –> 19 lectures • 1hr 17min.
Refresher: Policy gradient methods –> 5 lectures • 20min.
Deep Deterministic Policy Gradient (DDPG) –> 13 lectures • 1hr 9min.
Twin Delayed DDPG (TD3) –> 8 lectures • 31min.
Soft Actor-Critic (SAC) –> 8 lectures • 1hr 2min.
Hindsight Experience Replay –> 5 lectures • 26min.
Final steps –> 2 lectures • 2min.

Requirements

This course will introduce you to the state of the art in Reinforcement Learning techniques. It will also prepare you for the next courses in this series, where we will explore other advanced methods that excel in other types of task.

The course is focused on developing practical skills. Therefore, after learning the most important concepts of each family of methods, we will implement one or more of their algorithms in jupyter notebooks, from scratch.

Leveling modules:

– Refresher: The Markov decision process (MDP).

– Refresher: Q-Learning.

– Refresher: Brief introduction to Neural Networks.

– Refresher: Deep Q-Learning.

– Refresher: Policy gradient methods

Advanced Reinforcement Learning:

– PyTorch Lightning.

– Hyperparameter tuning with Optuna.

– Deep Q-Learning for continuous action spaces (Normalized advantage function – NAF).

– Deep Deterministic Policy Gradient (DDPG).

– Twin Delayed DDPG (TD3).

– Soft Actor-Critic (SAC).

– Hindsight Experience Replay (HER).

Get Tutorial

https://www.udemy.com/course/advanced-reinforcement/e1676bd24ed4f26da6dfdb9d5274227b5427af5c

Get Course

Join Telegram to Download More