Impact-Site-Verification: dbe48ff9-4514-40fe-8cc0-70131430799e

Search This Blog

DQN Control for Inverted Pendulum with Reinforcement Learning Toolbox

 Use the Deep Q-Network (DQN) algorithm in Reinforcement Learning Toolbox™ to:

1) Create the environment

2) Create DQN agent

3) Customize policy representation

4) Train DQN agent

5) Verify trained policy

6) Deploy trained policy with code generation


The provided pendulum environment has predefined observations, actions, and reward. The actions include five possible torque values, while the observations include a 50x50 grayscale image as well as the angular rate of the pendulum, and the reward is the distance from the desired upward position. See how the default DQN agent feature automatically constructs a neural network representation of the Q-function, used by the DQN agent to approximate long-term reward. Learn how to use Deep Network Designer app to graphically customize the generated Q-function representation.


See how you can visualize the pendulum behavior and logged data during training, and monitor training progress. After training is complete, verify the policy in simulation to decide if further training is necessary. If you are happy with the design, deploy the trained policy using automatic code generation.


00:00 Introduction

00:33 Load Predefined Environment

01:49 Create Default DQN Agent

03:39 Construct Custom DQN Critic Network

04:26 Train DQN Agent

06:13 Simulate & Validate Performance of DQNAgent

06:29 Deployment & Code Generation of DQN Agent

06:35 Conclusion


No comments

Popular Posts