Mastering RLHF with AWS: A Hands-on Workshop on Reinforcement Learning from Human Feedback

Published 2023-08-03

Download video MP4 360p

Recommendations

58:07

Aligning LLMs with Direct Preference Optimization
10:48

RLHF+CHATGPT: What you must know
1:03:32

John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges
49:07

[Webinar] LLMs for Evaluating LLMs
59:11

Deep Dive into LLM Evaluation with Weights & Biases
1:16:15

Stanford CS224N | 2023 | Lecture 10 - Prompting, Reinforcement Learning from Human Feedback
1:01:12

Building Multi-Modal Search with Vector Databases
1:00:18

Prompt-Engineering for Open-Source LLMs
52:21

Navigating LLM Threats: Detecting Prompt Injections and Jailbreaks
1:03:07

Practical Data Science on AWS: Generative AI
59:53

Efficient Fine-Tuning for Llama-v2-7b on a Single GPU
59:35

Building with Instruction-Tuned LLMs: A Step-by-Step Guide
1:00:38

Reinforcement Learning from Human Feedback: From Zero to chatGPT
1:02:12

How to Build, Evaluate, and Iterate on LLM Agents
1:02:38

AI Safety, RLHF, and Self-Supervision - Jared Kaplan | Stanford MLSys #79
1:31:13

A Hackers' Guide to Language Models
15:46

Introduction to large language models

Similar videos

1:11:49

RLHF - Reinforcement Learning with Human Feedback
1:00:38

Reinforcement Learning from Human Feedback From Zero to ChatGPT [Record of the live]
06:31

Reinforcement Learning: ChatGPT and RLHF
03:34

What is Reinforcement Learning with Human Feedback (RLHF) ?
12:38

Reinforcement Learning from Human Feedback (RLHF)
51:09

Getting Started with Reinforcement Learning with Human Feedback | Workshop Recap
47:16

Nathan Lambert - Reinforcement Learning from Human Feedback @ UCL DARK
59:36

Alignment: Fine-Tuning with RLHF (Reinforcement Learning with Human Feedback)
00:40

Reinforcement Learning from Human Feedback
08:13

Reinforcement Learning from Human Feedback (Natural Language Processing at UT Austin)
01:47

Unlock the Power of Generative AI with RLHF Powered by Appen
1:18:37

Generative AI: PEFT and RLHF workflows + Polars for blazing-fast dataframes in Ray and beyond
More results