Developing and Serving RAG-Based LLM Applications in Production

Published 2023-10-12

Download video MP4 360p

Recommendations

32:07

Fast LLM Serving with vLLM and PagedAttention
15:21

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use
19:52

How to set up RAG - Retrieval Augmented Generation (demo)
45:32

A Survey of Techniques for Maximizing LLM Performance
26:00

Building Corrective RAG from scratch with open-source, local LLMs
11:37

What is RAG? (Retrieval Augmented Generation)
55:19

Emerging architectures for LLM applications
21:33

Python RAG Tutorial (with Local LLMs): AI For Your PDFs
47:09

Llama3 Full Rag - API with Ollama, LangChain and ChromaDB with Flask API and PDF upload
59:14

High-performance RAG with LlamaIndex
15:30

The 5 Types of LLM Apps
23:43

RAG But Better: Rerankers with Cohere AI
2:33:11

Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer
18:35

Building Production-Ready RAG Applications: Jerry Liu
1:10:43

Ray: A Framework for Scaling and Distributing Python & ML Applications
30:28

Enabling Cost-Efficient LLM Serving with Ray Serve

Similar videos

06:36

What is Retrieval-Augmented Generation (RAG)?
28:44

Practical Data Considerations for Building Production-Ready LLM Applications
30:23

Building RAG-based LLM Applications for Production // Philipp Moritz & Yifei Feng // LLMs III Talk
24:03

Build a RAG Based LLM App in 20 Minutes! | Full Langflow Tutorial
35:23

Building LLM Applications for Production // Chip Huyen // LLMs in Prod Conference
02:53

Build a Large Language Model AI Chatbot using Retrieval Augmented Generation
59:53

Building Production-Grade LLM Apps
1:12:39

Building a RAG application from scratch using Python, LangChain, and the OpenAI API
24:04

Build and Deploy a RAG app with Pinecone Serverless
29:36

LangChain Templates Tutorial: Building Production-Ready LLM Apps with LangServe
53:15

Building a RAG application using open-source models (Asking questions from a PDF using Llama2)
12:04

LangChain in Production - Microservice Architecture (incl. FastAPI and Docker)
17:49

Deploy LLM App as API Using Langserve Langchain
22:40

Jerry Liu–LlamaIndex – Practical Data Considerations for building Production-Ready LLM Applications
More results