Reinforcement Learning: ChatGPT and RLHF

Published 2023-08-14

Download video MP4 360p

Recommendations

08:25

Reinforcement Learning from scratch
10:48

RLHF+CHATGPT: What you must know
42:40

State of GPT | BRK216HFS
08:14

Reinforcement Learning: AlphaGo
29:23

Generating Conversation: RLHF and LLM Evaluations with Nathan Lambert (Episode 6)
21:02

The Attention Mechanism in Large Language Models
18:22

Building The Next Large Model: trlX: A Framework for Open-Source RLHF
06:26

How ChatGPT actually works
1:18:36

Instruction finetuning and RLHF lecture (NYU CSCI 2590)
08:25

Large Language Models from scratch
1:03:32

John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges
08:43

ChatGPT can create PowerPoint presentations now?!
13:43

How ChatGPT is Trained
53:07

Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)
1:07:12

AI Trends 2023: Reinforcement Learning - RLHF, Robotic Pre-Training & Offline RL with Sergey Levine
12:38

Reinforcement Learning from Human Feedback (RLHF)
07:02

Splines in 5 minutes: Part 1 -- cubic curves
1:00:38

Reinforcement Learning from Human Feedback: From Zero to chatGPT
33:11

How ChatGPT Works Technically For Beginners
39:13

"Catching up on the weird world of LLMs" - Simon Willison (North Bay Python 2023)

Similar videos

15:53

ChatGPT and Reinforcement Learning
1:00:38

Reinforcement Learning from Human Feedback From Zero to ChatGPT [Record of the live]
2:14:29

How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF)
18:37

ChatGPT explained: A Guide to Conversational AI w/ InstructGPT, PPO, Markov, RLHF
1:01:01

Mastering RLHF with AWS: A Hands-on Workshop on Reinforcement Learning from Human Feedback
11:59

How ChatGPT is Trained - model and training explained
07:54

How ChatGPT Works Technically | ChatGPT Architecture
1:00:02

What is RLHF?
02:50

Learn about Reinforcement Learning from Human Feedback - ChatGPT / RLHF HuggingFace Course
More results