Efficiently Scaling and Deploying LLMs // Hanlin Tang // LLM's in Production Conference Published 2023-05-19 Download video MP4 360p Recommendations 08:24 LLM Deployment with NLP Models // Meryem Arik // LLMs in Production Conference Lightning Talk 2 30:25 Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral 53:48 Fine-Tuning LLMs: Best Practices and When to Go Small // Mark Kim-Huang // MLOps Meetup #124 33:50 Evaluating LLM-based Applications 30:28 Enabling Cost-Efficient LLM Serving with Ray Serve 59:48 [1hr Talk] Intro to Large Language Models 35:23 Building LLM Applications for Production // Chip Huyen // LLMs in Prod Conference 23:47 AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic" 28:18 Fine-tuning Large Language Models (LLMs) | w/ Example Code 32:07 Fast LLM Serving with vLLM and PagedAttention 25:20 Large Language Models (LLMs) - Everything You NEED To Know 1:31:13 A Hackers' Guide to Language Models 40:37 How to Build LLMs on Your Company’s Data While on a Budget 12:29 What are AI Agents? 16:55 The Future of Knowledge Assistants: Jerry Liu 43:28 Building an LLMOps Stack for Large Language Models | LLMs Similar videos 40:32 LLM in Practice: How to Productionize Your LLMs 09:29 How to deploy LLMs (Large Language Models) as APIs using Hugging Face + AWS 29:11 Developing and Serving RAG-Based LLM Applications in Production 32:07 Making LLM Inference Affordable // Daniel Campos // LLMs in Production Conference Part 2 00:58 Addressing Latency Challenges in Large Language Models 57:21 Large Language Models in Production Round-table Conversation More results