Efficiently Scaling and Deploying LLMs // Hanlin Tang // LLM's in Production Conference

Published 2023-05-19

Download video MP4 360p

Recommendations

08:24

LLM Deployment with NLP Models // Meryem Arik // LLMs in Production Conference Lightning Talk 2
30:25

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral
53:48

Fine-Tuning LLMs: Best Practices and When to Go Small // Mark Kim-Huang // MLOps Meetup #124
33:50

Evaluating LLM-based Applications
30:28

Enabling Cost-Efficient LLM Serving with Ray Serve
59:48

[1hr Talk] Intro to Large Language Models
35:23

Building LLM Applications for Production // Chip Huyen // LLMs in Prod Conference
23:47

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"
28:18

Fine-tuning Large Language Models (LLMs) | w/ Example Code
32:07

Fast LLM Serving with vLLM and PagedAttention
25:20

Large Language Models (LLMs) - Everything You NEED To Know
1:31:13

A Hackers' Guide to Language Models
40:37

How to Build LLMs on Your Company’s Data While on a Budget
12:29

What are AI Agents?
16:55

The Future of Knowledge Assistants: Jerry Liu
43:28

Building an LLMOps Stack for Large Language Models | LLMs

Similar videos

40:32

LLM in Practice: How to Productionize Your LLMs
09:29

How to deploy LLMs (Large Language Models) as APIs using Hugging Face + AWS
29:11

Developing and Serving RAG-Based LLM Applications in Production
32:07

Making LLM Inference Affordable // Daniel Campos // LLMs in Production Conference Part 2
00:58

Addressing Latency Challenges in Large Language Models
57:21

Large Language Models in Production Round-table Conversation
More results