Impact-Site-Verification: dbe48ff9-4514-40fe-8cc0-70131430799e

Search This Blog

Reinforcement Learning for Engineers, Part 3: Policies and Learning Algorithms

This video provides an introduction to the algorithms that reside within the agent. We’ll cover why we use neural networks to represent functions and why you may have to set up two neural networks in a powerful family of methods called actor-critic.

Reinforcement Learning by Sutton and Barto:
RL course by David Silver:
3B1B videos on neural networks:
Michael Neilson’s blog: Follow up:
A nice write up on policy gradient algorithms and the policy gradient theorem by Lilian Weng:
A paper motivating the usefulness of deep neural networks:
Brendan Fortuner’s blog:

Part 1: What Is Reinforcement Learning?

Part 2: Understanding the Environment and Rewards

Part 4: The Walking Robot Problem

Part 5: Overcoming the Practical Challenges

No comments