Democratizing Foundation Models via k-bit Quantization - Tim Dettmers | Stanford MLSys #82

Published 2023-10-23

Download video MP4 360p

Recommendations

55:59

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83
56:32

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86
30:48

QLoRA: Efficient Finetuning of Quantized LLMs | Tim Dettmers
57:05

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89
58:13

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94
1:19:06

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87
1:16:48

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88
57:58

QLoRA: Efficient Finetuning of Quantized Large Language Models (Tim Dettmers)
1:37:37

The Turing Lectures: The future of generative AI
17:07

LoRA explained (and a bit about precision and quantization)
1:01:53

Tim Dettmers | QLoRA: Efficient Finetuning of Quantized Large Language Models
58:26

A Guide to Parameter-Efficient Fine-Tuning - Vlad Lialin | Munich NLP Hands-on 021
27:26

22. Квантизация нейронных сетей. Иван Печенко
58:41

8-bit Methods for Efficient Deep Learning with Tim Dettmers

Democratizing Foundation Models via k-bit Quantization - Tim Dettmers | Stanford MLSys #82

Download video MP4 360p

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

QLoRA: Efficient Finetuning of Quantized LLMs | Tim Dettmers

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88

QLoRA: Efficient Finetuning of Quantized Large Language Models (Tim Dettmers)

The Turing Lectures: The future of generative AI

LoRA explained (and a bit about precision and quantization)

Tim Dettmers | QLoRA: Efficient Finetuning of Quantized Large Language Models

A Guide to Parameter-Efficient Fine-Tuning - Vlad Lialin | Munich NLP Hands-on 021

22. Квантизация нейронных сетей. Иван Печенко

8-bit Methods for Efficient Deep Learning with Tim Dettmers

Tim Dettmers—k-bit Inference Scaling Laws

8-bit Optimizers via Block-wise Quantization

AI on your phone? Tim Dettmers on quantization of neural networks — #41

MLT init Session #17: LLM int8

QLoRA: Quantization for Fine Tuning

Lecture 05 - Quantization (Part I) | MIT 6.S965

ML for ML Compilers - Mangpo Phothilimthana | Stanford MLSys #80

A Taxonomy of ML for Systems Problems - Martin Maas | Stanford MLSys #81

Democratizing Foundation Models via k-bit Quantization - Tim Dettmers | Stanford MLSys #82

Download video MP4 360p

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

QLoRA: Efficient Finetuning of Quantized LLMs | Tim Dettmers

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88

QLoRA: Efficient Finetuning of Quantized Large Language Models (Tim Dettmers)

The Turing Lectures: The future of generative AI

LoRA explained (and a bit about precision and quantization)

Tim Dettmers | QLoRA: Efficient Finetuning of Quantized Large Language Models

A Guide to Parameter-Efficient Fine-Tuning - Vlad Lialin | Munich NLP Hands-on 021

22. Квантизация нейронных сетей. Иван Печенко

8-bit Methods for Efficient Deep Learning with Tim Dettmers

Tim Dettmers—k-bit Inference Scaling Laws

8-bit Optimizers via Block-wise Quantization

AI on your phone? Tim Dettmers on quantization of neural networks — #41

MLT __init__ Session #17: LLM int8

QLoRA: Quantization for Fine Tuning

Lecture 05 - Quantization (Part I) | MIT 6.S965

ML for ML Compilers - Mangpo Phothilimthana | Stanford MLSys #80

A Taxonomy of ML for Systems Problems - Martin Maas | Stanford MLSys #81

MLT init Session #17: LLM int8