Deep RL Bootcamp Lecture 5: Natural Policy Gradients, TRPO, PPO

Published 2017-10-05

Download video MP4 360p

Recommendations

44:45

Deep RL Bootcamp Lecture 6: Nuts and Bolts of Deep RL Experimentation
38:24

Proximal Policy Optimization (PPO) - How to train Large Language Models
25:21

L4 TRPO and PPO (Foundations of Deep RL Series)
41:22

L3 Policy Gradients and Advantage Estimation (Foundations of Deep RL Series)
59:36

Policy Gradient Theorem Explained - Reinforcement Learning
34:55

Deep RL Bootcamp Lecture 4B Policy Gradients Revisited
1:33:58

RL Course by David Silver - Lecture 7: Policy Gradient Methods
24:50

Overview of Deep Reinforcement Learning Methods
25:51

Part 1 of 3 — Proximal Policy Optimization Implementation: 11 Core Implementation Details
28:48

Trust Regions
19:50

An introduction to Policy Gradient methods - Deep Reinforcement Learning
1:16:10

L1 MDPs, Exact Solution Methods, Max-ent RL (Foundations of Deep RL Series)
45:49

DRL Lecture 1: Policy Gradient (Review)
58:12

MIT Introduction to Deep Learning | 6.S191
57:33

MIT 6.S191: Reinforcement Learning
35:35

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

Similar videos

1:02:47

Proximal Policy Optimization (PPO) is Easy With PyTorch | Full PPO Tutorial
17:50

Proximal Policy Optimization Explained
49:43

Reinforcement Learning 8: Policy gradient methods
1:07:30

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)
13:18

Deep Policy Search Class: TRPO and PPO
1:07:31

lecture 15 natural policy gradient
More results